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General  Rational  &  Significance: 

The  classification  of  signals  from  the  surrounding  world  is  one  of  the  most  important, 
complex,  and,  at  the  same  time,  least  understood  capabilities  of  the  mammalian  brain. 

Traditionally,  attempts  to  comprehend  categorizadonal  mechanisms  have  utilized  the  physical 
depiction  of  the  incoming  signal,  e.g..  the  spectrogram  of  a  sound  or  its  waveform,  as  the  starting 
point  for  an  analysis.  Elxtensive  effons  to  unravel  'invariant  features'  in  speech,  the  most 
complexly  categorizable  signal,  have  largely  failed  to  disclose  reliable  and  valid  common  features 
in  the  physical  representation  of  the  signal  that  can  account  for  the  remarkable  robustness  of  the 
human  speech  categorizer  in  the  presence  of  speaker  variability,  noise  interference,  and  partial 
signal  omissions. 

An  alternative  approach  to  elucidate  fundamental  principles  underlying  signal  classificauon 
utihzcs  the  biologically  nrcproccssed  representation  of  signals  in  the  mammalian  birain.  This 
approach  nukes  u.sc  of  biologically  realized  and  physiologically  dcfiiuble  processing  strategies 
ajHl  resultant  signal  represenutions  in  the  CNS  tlut  undoubtedly  contribute  to  the  robustness  of 
the  signal  categonzer/classifier  in  numnuls. 

A  necessary  condition  to  achieve  this  goal  is  the  existence  of  and  the  access  to  a  complete 
and  systematic  biological  representaUQO  of  the  input  continuum.  So  far,  only  two  suges  along  the 
auditory  pathway  have  been  explored  witn  enough  detail  to  provide  such  a  biological 
rcpteseniauon  ol  the  input:  the  auditory  nerve  and  the  primary  auditory  cortex  (AI).  Whereas  there 
exist  a  multitude  of  isolated  obscivauons  in  the  intervening  stations,  only  in  the  two  before 
mentioned  lo  cls  have  there  been  attempts  to  reconstruct  a  complete  distributed  ensemble 
representation  for  test  stimuli.  Althou^  the  systenude  analysis  of  processing  properties  is  much 
more  advanced  in  the  relatively  unifunnly  organized  auditory  nerve,  the  more  complex  signal 
representation  in  the  auditory  cortex  provides  a  much  more  suitable  level  for  elucidating  the 
foundauons  of  bic^ogical  signal  classification.  The  primary  reason  for  this  assessment  is  that  more 
complex  processing  strategies,  e.g.  noise  reduction,  level  tolerance,  and  feature  enhancement, 
have  to  be  accomplished  mainly  beyond  the  level  of  the  auditory  nerve.  In  addition,  recent  studies 
of  the  cat  pnrmry  auditory  cortex  in  our  laboratory  and  elsewhere  clearly  indicate  that  the  form  of 
signal  representation  in  this  cortical  field  is  that  of  a  generalized  and  possibly  uncategorical 
represenution  not  unlike  that  of  the  primary  visual  cortex  and  in  contrast  to  a  speci^zed,  feature 
ctWiination  sensitive,  categorical  representation  that  has  been  shown  for  a  number  of  subfields 
surrounding  the  bat  primary  auditory  cortex. 

The  signal  encoding  and  representation  in  the  primary  auditory  cortex  of  cats  and  owl 
monkeys,  the  only  species  tliai !  .tve  been  studied  with  sufficient  detail  besides  the  bat,  shows  a 
number  of  systematic  and  spe*  a"y  distributed  features  that  may  provide  robustly  enhanced 
properties  that  form  the  basis  for  a  tolerant  signal  classification.  Other  auditory  cortical  fields, 
surrounding  AI.  appear  to  have  slightly  (AAF)  or  grossly  (AH)  different  response  characteristics 
from  those  seen  in  AI.  Biologically,  only  parallel  and  cooperative  processing  may  lead  to  the 
desired  signal  analysis  and  categotizadon.  For  our  analydeal  attempts,  it  is  similarly  likely  that 
only  a  combinadon  of  infomiation  from  all  cortical  fields  may  provide  us  with  all  necessary  cues 
for  a  n  animals  strategies  in  signal  categorization.  Nevertheless,  it  is  hypothesized  that  the 
represenution  of  acoustical  signals  in  the  primary  auditory  cortex  is  an  ideal  starting  point  to 
explore  the  c.'ics  available  for  sign.'  ’  caiegorization  after  extensive  biological  'prc'-processing. 
Based  on  a  systematic  cortical  ma’  ,  of  signals,  the  strategies  that  lead  to  such  a  represenution 
as  well  as  their  consequences  for  lection  of  classification  algorithms  for  speech  signals  and 
sonar  signals  can  be  explored. 

Thc.se  studies  attempted  to  examine  the  representation  of  acoustical  signals  in  the  primary 
auditory  cortex  of  mammals  and  to  derive  detection  and  classification  strategics  based  on  the 
biological  signal  represenution  of  complex  signals  in  the  CNS.  We  postula^  that  the 
subcortical/cortical  processing  of  signals  results  in  a  systematic  and  robust  signal  represenution  in 
the  primary  auditory  cortex  that  is  more  suitable  for  signal  detection  and  classification  schemes 
than  acoustical/spcctral  signal  rcprescntaiions.  Models  of  central  nervous  representations  were  to 
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be  developed  and  applied  to  the  transformation  and  classification  of  complex  signals.  The  general 
objectives  for  the  research  conducted  in  the  last  funding  period  were  as  follows: 

Objective  1:  To  explore  the  physiological  principles  undci  lying  the  spatio-temporal 
repre.<^ntation  of  simple  and  complex  signals  in  the  primary  auditory  cortex; 

Objective  2;  To  create  a  formalized  modcl/simulation  of  the  cortical  representation  of  complex 
signals  based  on  fundamental  principles  of  spatio-temporal  processing; 

Objective  3:  To  utilize  the  formali^  cortical  representation  as  the  input  (front  end)  to  a  self- 
organizing  signal  classifier  and  as  training  pattern  for  the  output  of  a  dynamic  neural  network. 

In  the  following  section,  the  progress  toward  these  objectives  over  the  last  funding  period  will  be 
briefly  reviewed, 


Objective  1:  In  order  to  explore  in  more  detail  the  physiological  principles  underlying  the 
spatio-temporal  representation  of  simple  and  complex  signals  in  tlic  primary  auditory  cortex  wc 
completed  a  series  of  experiments  that  utilized  narrow-  ^  broad-band  signals  to  de^ve  local 
filter  properties  of  single  cortical  neurons  and  small  groups  of  cortical  neurons.  The  underlying 
assumption  is  th.:t  the  characterization  of  local  filter  prop^cs  is  ncccssaiy  and  sufficient  to 
predict  the  cortical  neuronal  response  and  spatial  response  distribution  for  arbitrary  acoustical 
signals. 

Narrow-band  stimuli  and  frcQucncv  modulatjon  (in  collaboration  with  Julie  Mcndclson  and 
Mitchel  SutterV  The  spatial  distribution  of  several  response  parameters  to  pure  tone  stimulation 
(threshold,  best  level,  dynamic  range,  monotonicity:  Schreiner  et  al.,  1992)  and  frequency 
sweeps  was  evaluated  (Mcndclson  ct  al.,  1993).  In  addition,  wc  obtained  and  compared  the 
topographical  distribution  of  several  aspect  of  pure-tone  receptive  fields,  namely  best  frequency, 
excitatory  bandwidth,  response  threshold,  and  monoionicity  of  rate/lcvei  functions  (Schreiner  and 
Sutter.  1^2,  see  Appendix;  Sutter  ar>d  Schreiner  (in  press)  see  Appendix)  for  single  and  multiple 
unit  recordings  in  the  same  animals.  The  purpose  of  this  comparison  was  to  guide  the 
interpretation  of  multiple  unit  mapping  experiments  by  relating  spadaliy  integrated  response 
measures  to  the  propenies  of  their  generators.  Among  the  main  findings  of  these  studies  are: 

a)  All  studied  parameter  show  a  nonuniform  spatial  distribution  along  the  isofrequency  axis  of  cat 
auditory  cortex. 

b)  Several  of  these  parameters,  especially  FM  diiecdon  and  FM  speed  were  correlated  ’vith  the 
spadal  distribudon  of  the  excitatory  bandwidth  suggcsdvc  of  a  close  reiadonship  between  some 
dynamic  receptive  field  aspeas  and  stadc  receptive  field  properties.  Otlicr  parameters,  c.g. 
threshold  and  excitatory  bandwidth,  appear  to  be  indepervdcntly  organized. 

c)  In  dorsal  Al,  single  unit  and  multiple  unit  properties  were  similar  with  regard  to  the  baiKlwidth 
aeasures  but  different  for  the  intensity  parameters.  In  ventral  Al,  both  measures  showed  similar 

results  for  intensity  coding  parameters  but  differences  between  the  multiple  and  single  unit 
bandwidth  organizations.  Tbese  findings  provide  further  evidence  that  the  dorsal  and  ventral  half 
of  Al  process  information  differently.  Kffcrenccs  between  single  unit  mapping  and  multiple  unit 
mapping  could  be  accounted  for  by  differences  in  the  local  CF  scatter  across  Al  and  differences  in 
the  local  scatter  of  response  threshold.  In  addition,  the  maximum  response  strength  of  single 
neurons  to  pure  tones  varies  in  Al  with  nonmonotonic  neurons  usually  contributing  more  spikes, 
at  best  level,  than  monotonic  neurons. 

d)  One  region  in  the  center  of  the  dorsal-ventral  extent  of  ca  Al  appears  to  have  a  constellation  of 
parameters  that  may  subserve  a  special  signal  detection  purpose.  Near  the  border  between  ventral 
and  dorsal  Al,  neuronal  recordings  arc  shaipiy  raned  for  frequency  (high  Q  values)  and  for 
amplitude  (high  degree  of  nonmonotonicity).  In  addition,  the  frequency  scatter  and  threshold 
scatter  is  minimal  and  the  neurons  show  the  lowest  response  thresholds,  lliis  constellation 
fulfills  the  conditions  to  serve  as  a  region  for  fine  spectial/atnplitudc  filtering  which  will  only 
respond  to  frequency-banded  components  with  intensities  just  above  the  animals  threshold.  Since 
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it  has  been  shown  that  addition  of  background  noise  will  shift  the  rate/level  function  of 
nonmonotonic  neurons  to  higher  levels,  this  region  maintains  its  ability  to  detect  a  signal  close  to 
threshold  even  in  the  presence  of  background  noise  (see  also  below). 

e)  A  study  of  tire  spatial  OTganization  of  the  primary  auditory  cortex  in  owl  monkeys  (Recanzone 
and  Schreiner,  in  preparation)  revealed  similar  principles  as  seen  in  the  cat,  i.e.  bandwidth, 
threshold,  and  latency  gradients  along  the  isofr^uency  axis  as  well  as  well  expressed,  elongated 
binaural  interaction  patches  (or  bands).  These  findings  suggest,  that  the  organization  of  AI  seen  in 
the  cat  reflects  general  organizational  principles  including  that  of  human  auditory  cortex. 


Narrow-band  stimuli  are  a  versatile  tool  to  easily  characterize  neuronal  receptive  fields,  since  the 
stimulus  is  chosen  to  be  clearly  definable  regarding  temporal,  frequency,  and  intensity 
characteristics.  Resulting  temporal,  frequency  and  intensity  transfer  function  for  a  receptive  field 
usually  keep  two  of  these  aspects  constant  and  vary  tlie  third  one.  Naturally  occurring  stimuli, 
however,  usually  consist  of  a  wide  range  of  temporal,  frequency  and  intensity  combinations  that 
render  it  extremely  difficult,  if  not  impossible,  to  predict  cortical  neuronal  responses  based  on  the 
narrow-band  transfer  functions  accumulated  in  frequency  response  areas.  For  example,  the  strong 
level  dependence  of  the  shape  of  a  frequency  transfer  function  (a  cross  section  through  the 
frequency  response  area)  is  principally  different  from  a  simple  FIR  filter  that  can  be  applied  to  a 
complex  sign^  at  any  level.  To  what  degree  c^n  frequency  response  areas  indeed  account  for  the 
response  of  complex,  broadband  signals?  In  a  first  attempt  to  address  this  question,  we  developed 
a  stimulus  that  ultimately  should  enable  us  to  provide  an  answer.  The  chosen  stimulus  consist  of 
a  broadband  signal,  either  noise  or  a  harmonic  scries,  whose  spectral  envelope  is  sinusoidally 
modulated  (on  a  logarithmic  frequency  scale  to  make  the  envelope  features  largely  frequency  shift 
invariant).  Parameters  of  the  spectral  envelope  are  'frequency':  ripple  density  measured  in 
l/octavc;  'amplitude':  modulation  depth  measured  in  dB;  and  'phase';  frequency  position  of 
maxima  or  minima  of  the  spectral  envelope,  measured  in  degree.  These  'ripple'  spectra  are 
essentially  the  same  a.s  sinusoidal  luminance  gratings  used  for  the  investigation  of  the  visual 
system.  Their  relevance  for  the  auditory  system  is  threefold.  First,  they  sdlow  to  study  directly, 
systematically,  and  parametrically  propatics  of  cortical  cells  in  response  to  a  broadbamd,  complex 
signal.  Second,  their  form  resembles  closely  an  important  aspects  of  ail  vocalizations,  namely 
vocal  tract  resonances  that  result  in  spectral  peaks  or  formants  in  human  speech  as  well  as  in 
animal  vocalizations.  Third,  the  sinusoidal  nature  of  the  spectral  envelope  allows,  with  some 
caveats,  a  system  theoretical  approach  that  car  provide  an  estimate  of  tlx  frequency  response  area 
based  on  a  broadband  stimulus  (for  details  see  below). 

By  studying  responses  to  different  spectral  envelope  frequencies,  wc  obtained  information 
regarding  the  processing  of  elementary  signals  that  may  lead  to  the  understanding  of  the 
processing  of  arbitrary  spectral  envelope  waveforms,  including  formants  in  voc^zations,  and 
'spectral  edges'  such  as  notches  due  to  resonances  in  the  outer  ear.  The  spectra  with  sinusoidal 
envelopes  or  'ripple  spectra'  applied  in  this  study  have  the  following  chai^teristics:  the  carrier 
consists  of  a  harmonic  series  (fo  ranging  from  50  to  2(X)  Hz)  with  a  6dB/octave  decline  of  the 
component  amplitudes  ( 1 20  to  255  components);  the  bandwidth  of  the  stimulus  is  usually  3 
octaves;  the  spectral  envelope  of  the  signal  is  represented  by  a  sinusoid  on  a  logarithmici^y 
scaled  frequency  axis  (since  the  basilar  membrane  essentially  ptx)vidcs  a  receptor  surface  with  a 
logarithmic  frequency  distribution);  the  frequency  of  the  envelope  sinusoid  is  referred  to  as  ripple 
density  (ripplcs/octaves);  the  modulation  depth  of  the  envelope  (ripple  depth)  is  linear  on  a  dB 
scale. 

In  this  study,  the  geomeuical  center  of  the  barrd-limited  signal  was  always  at  a  maximum  of 
the  sinusoidal  spectral  envelope.  The  center  was  positioned  at  the  CF  of  each  cortical  neuron  and 
the  ripple  density  or  the  frequency  distance  between  spectral  peaks  was  systematically  varied. 

The  resulting  'ripple  transfer  function'  was  reconstructed  for  different  m^ulation  depth  and 
overall  intensities.  For  the  majority  of  neurons,  the  ripple  transfer  function  was  a  bandpass  and  a 
'best'  ripple  density  could  be  defined.  The  remaining  transfer  f  jnetions  appeared  to  have  a 
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iowpass  characteristic,  at  least  for  those  ripple  densities  employed  in  this  study  (0.3  to  8 
ripples/octave).  Best  ripple  densities  rang^  from  0.3  to  4  lipple/octave  with  a  mean  around  1 
ripple/octavc. 

The  spacing  of  formants  in  human  speech  can  be  expressed  in  ripple  density  by  calculating 
the  reciprocal  of  the  spectral  distance  between  two  formants  measured  in  octaves  [e.g.  1/  (2 
octaves)  corresponds  to  a  ripple  density  of  0.5].  Most  formant  spacings  in  human  vowels  arc  in 
the  range  of  0.6  to  1  ripple/octave  with  the  majority  below  3  ripples/  octave.  Examination  of  the 
ripple  densities  present  in  the  vocalization  of  cats  ^so  shows  a  prominent  peak  around  a  ripple 
density  of  1  ripple/octave. 

These  findings  suggests  that  neurons  in  the  auditory  cortex  can  serve  as  filters  for  different 
spatial  frequencies  or  spectral  envelope  frequencies  and  that  the  fUteis  are  {^ominandy  tuned  to 
spectral  envelope  features  and  frequencies  that  are  prominent  in  their  behaviorally  relevant 
acoustic  environment. 

Spatial  mapping  of  responses  to  ripple  spectra  along  the  isoi^uency  domain  of  A1 
revealed  a  systematic  .>hift  of  the  best  ripple  density  in  multiple  unit  responses  from  central  A1  to 
dorsal  Al.  Tbe  shift  paralleled  the  previously  described  variation  of  integrated  excitatory 
bandwidth  with  sharply  tuned  locations  and  high  best  ripple  densities  near  the  dorso-ventral 
center  and  broader  tuning  curves  witii  lower  best  ripple  densities  toward  the  dorsal  end  of  AI. 
Ventral  AI  showed  a  less  systematic  distribution  of  ripple  densities. 

The  findings  indicate  that  the  auditory  cortex  is  well  suited  for  the  analysis  of  complex, 
broad-band  stimuli.  The  results  support  the  hypothesis  of  a  systematic  'spatial  frequency  '  or 
'spectral  envelope  frequency'  representation  in  AI.  This  representation  is  oriented  orthogonal  to 
the  fr^uency  axis  and,  therefore,  is  available  for  c'^cry  fr^ucncy  channel.  The  representation  is 
covering  the  main  spectral  envelope  frequency  range  found  in  complex  signals  such  as  speech 
and  can  serve  as  an  analyzer  and  classifier  for  arbitrary  spectral  waveforms,  including  important 
classes  such  as  formants  and  spectral  notches.  The  topographic  distribution  of  different  spectral 
envelope  frequencies  across  AI  supports  the  psychophysical  finding  that  separate  independent 
'ripple  frequency  channels'  exist  that  can  be  t^pted  (long  duration  of  stunulation  with  a  given 
ripple  density  will  degrade  the  discrimination  of  that  ripple  frequency  channel  but  not  others; 
Hillicr,  1992)). 

Previously,  we  have  shown  that  the  cortical  organization  of  frequency  representation  cart  be 
altered  through  experience  (see  Recanzonc  ct  al.,  1993).  In  an  attempt  to  demonstrate  that  the 
cortied  representation  of  inf^ormation  within  the  isofrequenev  domain  can  be  adjusted  tlm>ugh 
experienre  as  well,  we  trained  several  cats  to  discriminate  between  two  stimuli  that  possess^  the 
same  spectral  envelope  frequency  but  with  different  locations  of  the  spectral  maxima  (Keeling  ct 
al.  1995a  (submitted)  see  Append).  It  was  hypothesized  that  exposure  to  a  specific  spectral 
envelope  frequency  that  is  made  behavioral  relevant  to  the  animal  would  emphasize  and  refine  the 
cortical  representation  cf  that  envelope  characteristic.  It  was  found  that  some  but  not  all  animals 
were  capable  of  improving  there  ability  to  discriminate  those  sounds  over  the  period  of  several 
month  of  training.  Elcctrophysiologically,  it  was  found  that  the  ripple  transfer  functions  of 
neurons  in  AI  of  trained  cats  were  indeed  different  from  those  in  untrained  control  animals 
(Keeling  ct  al.  1995b  (submitted)  sec  Appendix).  The  peak  of  the  compound  ripple  transfer 
functions  shifted  toward  the  trained  envelope  frequency  and,  in  addition,  an  inenused 
responsiveness  to  higher  spectral  envelope  frequencies  (broadening  of  ripple  transfer  functions) 
was  observed. 

These  findings  suggest,  that  behavioral  training  (not  just  exposure)  to  specific  aspects  of 
complex  sounds  can  result  in  an  improvement  of  the  psychophysical  capacity  of  a  subject  (time 
course:  at  least  four  to  eight  weeks  of  training)  and  iliat  cortical  neurons  can  alter  theii  response 
characteristics  as  a  consequence  of  that  training.  It  is  likely,  that  the  improvement  in  the  cortical 
representation  of  the  trainetl  signal  features  is  a  necessary  condition  for  improvement  of  the 
psychophysical  capacity.  However,  at  this  time  it  is  not  clear  whether  the  <k)scrvcd  cortical 
changes  are  also  sufficient  for  psychophysical  improvement  to  occur. 
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lemporal  andBinaural  Response  Properties  (in  coUaboration  with  Mitchell  Sutter  and  Julie 
Mendelson'):  The  neuronal  responses  to  tones,  broadband  transient  stimuli,  and  frequency- 
modulated  (FM)  sweeps  were  mapped  in  the  primary  auditory  cortex  (AI)  of  barbituratc- 
anaesthetiz^  cats  (Schreiner  et  al.,  1995  (submitted)  sec  Appendix).  The  spatial  distribution  of 
the  final  two  response  parameters  in  this  series  of  experiments,  onset  latency  and  binaural 
response,  were  determined  in  four  cases.  The  functional  relationship  between  these  two 
parameters  and  the  previously  reported  spectral  (Qiodg,  Q40dB’  i^adband  transient  stimulus), 
intensity  (threshold,  strongest  response  level  (SRL),  dynamic  range  and  monotonicity)  and 
temporal  (FM  sweep  direction  and  speed)  parameters  was  subject^  to  both  a  global  and  regional 
analysis.  Onset  latency  responses  were  systematically  distributed  along  the 
dorsoventral/isofrequency  axis  of  AI  such  that  units  with  shorter  latencies  were  located  in  the 
central  region  while  units  with  longer  latencies  were  more  often  found  in  the  dorsal  and  ventral 
portions  of  the  cortex.  As  shown  by  other  investigators,  alternating  bands  or  patches  of  EE  and  El 
units  were  also  distributed  across  the  cortex.  The  results  of  a  point-by-point  analysis  revealed  a 
lack  of  dependency  of  onset  latency  on  binaural  response  type.  However,  when  the  relationship 
between  onset  latency  response  and  the  remaining  10  response  parameters  was  examined,  the 
correlations  with  the  following  parameters  were  found  to  be  statistically  significant:  Q^odB* 
Q4q^,  broadband  transient  response,  strongest  response  level  (SRL),  monotonicity,  and 
preferred  FM  sweep  direction.  These  correlations  suggest  that  units  that  respond  with  shorter 
onset  latencies  arc  more  sharply  tuned,  do  not  respond  well  to  broadband  stimuli,  have  a  higher 
degree  of  nonmonotonicity,  and  prefer  FM  sweeps  that  change  from  a  high  to  a  low  frequency. 
Binaural  response  type  was  found  to  be  significantly  corrcla^  with  broadband  transient 


response,  threshold,  SRL,  dynamic  range,  and  preferred  FM  sweep  speed.  These  correlations 
imply  that  El  neurons  tend  to  have  sharper  tuning,  weaker  responses  to  broadband  stimuli,  lower 
tl.\resholds,  lower  SRLs,  a  narrower  dynaittic  range,  higher  degree  of  nonmonotonicity,  and 
prefer  slower  FM  sweeps  than  EE  units.  The  systematic  distribution  and  the  functional 
relationship  between  these  response  parameters  may  provide  the  representational  basis  for  the 
deuxmon  and  identification  of  specific  features  in  the  artimal’s  natural  cnvironmcnt.Thc  presence 
of  a  number  of  response  paraiKtcrs  that  appear  to  he  systematically  organized  within  Al  may 
provide  a  set  of  criteria  by  which  the  doisovcatral  extent  of  AI  can  be  delineated.  With  the 
exception  of  binaural  response,  changes  in  the  distribution  of  other  response  features,  c.g.,  onset 
latency,  are  gradual  on  a  global  scale,  but  can  be  quite  variable  on  a  local  scale.  However,  t)K.'.y 
may  not  necessarily  provide  an  unequivocal  set  of  criteria  that  would  allow  for  a  clear  demarcation 
of  the  borders  to  areas  neighbouring  dorsal  and  ventral  AI. 

On  a  functional  level,  the  systematic  distribution  of  these  (and  perhaps  other  as  yet  unknown) 

respOTse  parameters  may  reflect  the  representational  basis  for  the  detection  and  identification  of 

specific  features  of  sounds  arising  from  the  natural  environment.  Schreiner  and  Sutter  (1992)  have 

recently  suggested  that  the  region  in  Aid  (da^ial  to  the  Qi0(}b  peak  )  may  be  better  suited  for 

integrative  analysis  of  broadband  stimuli  by  resp<^nding  in  a  relatively  undifferentiated  manner  to 

tones  OT  sprctral  peaks  of  different  frequencies.  ’  his  is  supported  by  the  observation  that  cells  in 

this  subregion  tend  to  be  more  broadly  tuned  Oj  i  to  have  a  high  responsiveness  to  broadband 

stimuli  such  as  clicks.  In  contrast.  AIv  mav  be  better  suited  for  differential  analvsis  of  the  snectral 

■  *  *  ■ 

properties  of  broadband  stimuli  since  units  there  arc  characterized  by  sharp  tuning  which  is 
distributed  over  varying  ranges  of  CFs  and  which  do  not  seem  to  respond  to  broadband  stimuli  as 
well  as  cells  in  dors^  AI. 


Receptive  field  characterization  (in  collaboration  with  Dr.  Xioaquin  Wang):  One  of  the 
main  problems  encountered  in  constructing  filters  to  model  the  function  of  cortical  neurons  is 
that  it  is  extremely  difficult  to  predict  the  response  of  neurons  to  complex  signals  because  the 
exact  timing  of  an  action  potential  occurrence  depends  on  a  number  of  parameters  that  have  not 
been  well  described,  such  as  the  frequency  dependent  latency  profile  in  the  excitatory  portion 
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of  the  tuning  ciuve  or  the  influence  of  inhibitory  influences  on  tlic  arrival  tune  of  spikes. 
Accordingly,  it  is  impossible  to  precisely  assign  spikes  in  the  response  to  a  complex  signal  to  a 
certain  event  in  the  stimulus.  The  techniqi.c  of  reverse  correlation,  however,  provides  a  means 
to  obtain  a  probability  measiue  of  which  signal  portion  produced  a  given  spike. 

The  technique  we  developed  is  similar  to  ^e  reverse-correlation  metliod  used  by  Jones 
and  Palmer  (1987)  in  the  visui  system  and  by  Eggermont  et  al.  (1983)  in  the  auditory 
periphery.  However,  the  approach  had  to  be  somewhat  modified.  In  the  peripheral  auditory 
system,  Ae  reverse  correlation  method  averages  the  signal  waveform  preceding  every  neuronal 
event  during  a  long  period  of  continuous  stimulation  with  either  complex  signis  or  noise 
amplitude  modulated  with  a  lowpass  noise.  The  result  is  a  pre-event  stimulus  ensemble  (PESE) 
reflecting  the  probability  distribution  in  the  stimulus  that  precedes  a  neuronal  event  (spike).  In 
the  periphery,  the  phaselocking  of  the  response  to  the  stimulus  waveform  is  a  cmdaJ  for 
successfully  deriving  the  PESE.  The  timing  of  cortical  spikes,  however,  is  too  imprecise  and 
does  not  phaselock  to  the  carrier  component  of  stimuli.  A  direct  reverse  correlation  approach  to 
the  raw  stimulus  waveform  would  result  in  a  flat  PESE  due  to  the  temporal  jitter  of  the  spike 
occurrence.  By  contrast,  if  the  reverse  correlation  is  done  on  the  basis  of  the  spectral  and 
temporal  envelopes  of  the  complex  (natural)  sounds,  it  will  be  successful  in  characterizing  the 
spectro-temporal  receptive  field  of  neurons  since  the  time  jitter  in  cortical  responses  is  small 
compared  to  the  slow  temporal  events  the  cortex  is  able  to  follow  (e.g.  Schreiner  an  Urbas, 
1988). 

Figures  1  and  2  give  two  examples  of  the  result  of  this  modified  reverse  correlation 
method.  A  naiTowly  tuned  (frequency  response  area  in  Fig  ID)  and  a  more  broadly  tuned 
neuron  (Fig.  2D)  were  studied.  The  spike  arrival  times  during  stimulation  with  pure  tones  and 
a  set  of  natural  vocalizations  were  obtained.  The  PESE  was  reconstructed  by  first  constructing 
a  128-channci  envelope/time  representation  of  the  signals,  i.e.  each  signal  was  split  into  128 
narrow  frequency  bands  and  the  temporal  envelope  for  each  band  was  obtained  by  lowpass 
filtering  each  band  (cutoff;  2(K)Hz,  well  above  the  normal  cutioff  frequency  for  cortical 
neuions).  Then,  the  signal  content  of  each  band  was  accumulated  for  a  30  msec  period 
inunediatcly  prior  to  the  arrival  of  each  spike.  Since  the  spectral  content  of  the  vocalization  did 
not  cover  the  whole  frequency  range  of  24  kHz  used  in  these  examples  with  the  same 
probability,  we  subtracted  from  this  raw  PESE  th**.  PESE  that  would  emerge  if  the  same 
number  of  spikes  elicited  by  the  stimulus  would  occur  randomly  during  the  stimulus.  This 
results  in  the  spectrally  corrected  PESE.  Figure  lA  and  2A  show  the  PESE  for  the 
vocalizations.  Note  that  the  red  areas  correspond  to  high  probability  values  that  a  signal  of  that 
nature  occurred  before  the  spike,  green  corresponds  to  the  signal  occurrence  probability 
without  causal  reiationshif)  between  signal  and  spike,  and  blue  corresponds  to  a  reduced 
probability,  i.e.  during  those  sections,  a  signal  had  to  be  absent  in  order  for  a  spike  to  occur 
(presumably  reflecting  inhibitory  properties  of  the  receptive  field).  In  Figure  1  A,  the  region  of 
elevated  probability  is  located  between  15  and  20  ms  before  the  spike  occunencc  and  is 
restricted  to  a  fairly  narrow  frequency  range.  This  range  corresponds  well  to  the  timing  and 
spectral  properties  of  the  excitatory  pure  tone  response  (see  Fig.  13  and  ID).  However,  other 
features  of  the  receptive  field  emerge  that  are  not  seen  in  the  pure  tone  response,  among  them 
arc  the  blue  regions  of  inhibition  surrounding  the  excitatory  region  on  botit  sides,  and  a  slant  in 
the  probability  disffibution  indicating  that  certain  frequency  nwdulation  directions  and  speeds 
may  be  optimal  for  this  cell. 


Objective  2:  One  of  the  objectives  of  our  work  on  the  auditory  cortex  is  to  be  able  to  predict  the 
response  to  complex  signals  from  the  leccptivc  field  information.  In  ordci’  to  accomplish  that,  we 
have  to  develop  a  formalized  version  of  a  filter  model  based  on  the  experimentally  obtained 
receptive  field  properties  and  spatial  distributions.  The  applications  of  such  a  formalized  model  arc 
manifold.  Of  special  interest  to  us  is  their  potential  use  as  a  front  end  for  a  biological  based  signal 
classifier,  their  use  as  a  trainer  for  network  models,  and  their  ability  to  predict  spatial  distribution 
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pattern  of  activity  to  complex  signals  which  is  necessary,  for  example,  in  the  interpretation  of 
optical  recordings  from  the  auditory  cortex  or  of  output  patterns  of  selforganizing  networks. 

We  develo]^  a  filter  model  that  was  based  on  the  experimentally  obtained  characteristics  of 
133  cortical  neurons  distributed  over  a  narrow  frequency  range  (center  frequency  5  kHz,  range 
approximately  0.5  octaves)  and  sampled  along  the  entire  extent  of  the  dorsal-ventral  axis  of  AI.  A 
network  of  133X21  filters  was  constructed  that  simulated  an  area  of  approximately  6X4  ram  of 
AI.  The  distribution  of  the  activity  of  the  model  neurons  was  determined  after  the  application  of 
tone  bursts  at  CF  for  20  different  sound  pressure  levels  (50  to  lOOdB). 

The  receptive  fields  of  the  model  neurons  were  realized  as  a  set  of  20  level  dependent 
bandpasses.  The  width  and  the  shape  pf  the  passbands  of  the  neurons  were  first  reconstructed 
from  experimentally  determined  frequency  response  areas  by  using  their  response  threshold,  best 
level,  rate/level  monotonicity,  Q-lOdB,  and  Q-40dB  information.  The  lower  and  upper  limits  of 
the  passband  at  different  sound  pressure  levels  were  calculated  from  CF,  threshold,  Q-lOdB  and 
Q-40dB,  under  the  assumption  that  the  frequency  response  area  are  symmetrical  around  CF. 
Taking  the  logarithmic  scaling  of  the  frequency  axis  into  account,  the  respective  upper  (fu)  and 
lower  (fi)  cutoff  frequency  values  were  calculated  for  IfidB  and  40dB  above  threshold: 

fl  =  CF*(Vl+0.25*Q2  -  {l/(2*Q)})forQ  =  Q-10dBorQ-40dB 
and 

fu  =  CF2/fl. 

Thereafter,  the  frequencies  were  transferred  into  logarithmic  coordinates  and  straight  lines  were 
drawn  between  the  minimum  threshold  value  at  CF  to  the  fu  and  fi  for  Q-lOdB  and  from  there  to 
the  fu  and  fi  for  Q-40dB.  Cutoff  frequencies  above  40dB  were  extrapolated  by  extending  the 
border  of  the  tuning  curve  with  the  same  slope  as  between  the  Q-lOdB  and  Q-40dB  positions. 

The  shape  of  the  passband  at  a  given  level  is  not  flat  but  the  magnitude  is  frequency  o  pendent  In 
order  to  determine  its  shape,  at  first  the  magnitude  at  CF  was  calculated  for  different  sound 
pressure  levels.  Below  threshold  the  magnitude  was  set  to  zero.  From  threshold  to  a  transition 
point  2()dB  above  threshold  the  magnitude  increased  linearly  (dynamic  range),  and  above  the 
transition  point  the  magnitude  changed  according  to  the  nonmonotonicity  value  of  the  neuron.  To 
compare  the  activities  of  different  neurons,  the  magnitude  at  the  transition  point  was  set  to  100. 
After  determination  of  the  magnitude  of  the  passband  at  CF,  the  magnimdes  at  the  other 
frequencies  in  the  passband  was  determined  for  all  sound  pressure  levels  by  drawing  a  line  from 
the  lower  boundary  of  the  passband  to  the  maximum  at  CF  and  from  this  point  to  the  upper 
boundary  of  the  passband.  Since  logarithmic  frequency  coordinates  were  used,  the  passband 
consisted  of  two  parts  of  exponentially  shaped  curves. 

After  the  reconstruction  of  the  frequency  rcsjionse  area,  for  each  sound  pressure  level 
equivalent  bandpass  filters  were  designed.  To  be  able  to  filter  different  signals  with  different 
energies,  the  amplification  of  the  neuron  for  each  sound  sound  pressure  level  was  determined  by 
dividing  the  magnitude  of  the  filter  through  the  amplitude  of  the  sound  pressure  level.  Since  the 
passbands  were  non-flat  we  used  the  modified  Yule-Walker  method  to  design  recursive  digital 
filters  which  pro\ide  an  optimal  least  square  fir  to  the  shape  of  the  filter. 

The  network  was  constmeted  by  oiticring  the  properties  of  the  model  neurons  according  to 
their  position  along  the  isofrequency  line  and  setting  the  CF  of  the  neurons  to  a  fixed  value.  The 
center  row  of  the  network  was  set  to  a  CF  of  5  kHz.  In  order  to  expand  the  network  to  different 
CFs,  the  isofrequency  line  was  duplicated  and  the  passbands  of  all  neurons  in  tlie  line  was  shifted 
by  a  specific  CF  factor  to  cover  a  range  of  neurons  with  CFs  between  1.25  and  20  kHz  (+/-  2 
octaves). 

The  network  vas  stimulated  with  50ms  tone  bursts  at  21  different  frequencie;;  (1.25  to 
20kH2  in  equidistant  logarithmic  scales)  and  20  different  amplitudes  (5  to  100  dB  SPL)  were 
generated.  First  die  p>ower  of  the  signals  were  determined,  and  then  the  signals  weie  bandpass 
filtered  with  the  appropriate  digital  filter.  The  relative  activity  of  each  neuron  was  defined  as  the 
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maximal  amplitude  of  the  filtered  signal  after  cessation  of  the  overshoot  to  the  onset  of  the  input 
signal,  i.e.  in  the  range  5  to  50  ms  after  toe  tone  onset. 

Figures  3,4  and  5  show  the  resulting  activity  distribution  of  the  133X21  model  neurons.  On 
the  x-axis,  neurons  with  different  frequency  response  areas  are  plotted,  representing  the 
isofrequency  domain.  Small  numbers  correspond  to  locations  near  the  dorsal  end  of  AI,  while 
high  numbers  stand  for  neurons  located  near  the  ventral  part  of  AI.  The  neurons  arc 
approximately  equidistant  along  this  axis.  Along  the  y-axis,  neuron  differ  in  their  CP,  such  that  -2 
inicates  neurons  with  CFs  of  1/4  of  the  neurons  at  zero,  i.e.  they  are  2  octaves  below  the  center 
values.  Because  of  the  dimensions  of  cat  AI,  the  length  of  the  y-axis  corresponds  to 
approximately  4  mm.  On  tlie  z-axis,  the  relative  activity  of  the  nxxiel  neurons  at  the  different 
locations  is  plotted  for  three  different  sound  pressure  levels  (lOdB,  45dB  and  90dB)  of  tone  burst 
stimuli  of  5kHz  (corresponding  to  frequency  position  zero).  Note  ^at  at  the  low  sound  pressure 
levels  active  neurons  are  chiefly  found  around  the  y-axis  values  of  zero,  and  that  more  neurons  in 
the  center  of  the  x-axis  are  active.  This  corresponds  to  the  low  threshold  region  found  in  the 
center  of  AI.  At  intermediate  sound  pressure  levels  (45dB,  Figure  4)  almost  all  neurons  with  CF 
=  0  are  active  and  the  range  of  active  neurons  is  broader.  Within  the  isofirequency  domain, 
neurons  at  the  ventral  and  dorsal  end  of  the  network  arc  more  active  than  neurons  at  the  center  due 
to  threshold  and  monotonicity  properties  of  the  neurons.  At  high  sound  pressure  levels  (9()dB, 
Figure  5)  many  neurons  with  different  CF  at  the  ventral  and  dorsal  extremes  of  the  network  are 
activated  while  fewer  neurons  in  the  center  of  the  network  are  active.  This  spread  of  activation  at 
high  levels  corresponds  well  to  the  actual  behavior  of  auditory  cortex. 

Despite  the  simplicity  of  this  formalized  model  of  cortied  neurons,  a  number  of  behaviors  of 
the  activity  distribution  reflect  the  actual  findings  of  cortical  activity  distributions.  However,  a 
number  of  improvements  need  to  be  added  to  the  model  in  order  to  allow  modeling  of  signals 
other  than  pure  tones.  There  is  no  need  for  the  introduction  of  inhibitory  sidebands  for  the 
prediction  of  pure  tone  responses  since  the  pure  tone  receptive  fields  include  all  of  the  necessary 
information.  By  contrast,  Ae  generation  oi  responses  to  broadband  signals  has  to  include  filter 
properties  that  take  into  account  inhibitoiy  interactions. 

Objective  3:  Two  approaches  to  selforganizing  models  of  cortical  activity  or  parameter 
distributions  have  been  initiated  over  the  previous  funding  period. 

Connectivitv  model  (in  collaboration  with  Koichi  Samishimal:  The  mammalian  central  auditory' 
system  is  a  complex  system  with  huge  numbers  of  variables  contributing  to  its  functions.  It  is  of 
common  sense  that  with  adequate  selection  of  parameters  values  is  it  possible  to  generate  any  kind 
of  behavior  from  a  system  with  high  degiees-of-freedom.  With  the  current  limited  computer 
power  resources,  it  is  necessary  to  introduce  some  reasonable  simplifications  into  the  iTKxlel.  The 
goal  of  this  work  is  to  reproduce  the  response  characteristics  and  <tistribution  of  primary  auditory 
cortex  based  on  available  neuroanatomic  connectivity  information  for  the  auditory  cortex  and 
simple  neural  network  cell  model  with  a  Hebbian-like  learning  rule.  It  is  postulated  tliat,  from  this 
starting  point,  a  selforganizing  process  can  recreate  some  of  the  features  experimentally  observed. 
It  is  hoped  that  the  outcome  of  this  approach  will  provide  some  insights  to  and  predictions  of 
auditory  system  physiology  that  might  lead  to  further  experimentally  addressable  questions. 

In  this  modelling  we  are  primarily  interested  in  the  understanding  the  cortical  representation 
of  responses  to  different  dimensions  of  sound,  e.g.  bandwidth  and  intensity,  across  AI.  The 
proposed  neural  network  has  five  layers,  one  cochlear  excitatory  input,  two  subcortical  layers  and 
two  cortical  layers,  as  depicted  in  Figures  6, 7  and  8.  The  first  layer  consists  of  a  one  dimensional 
array  of  15x1  cells  representing  the  cochlear  band-pass  filter  and  hair  cells.  The  subcortex  has  two 
layers,  one  excitatory'  and  one  inhibitory,  of  15x10  cells.  The  cortex  is  represented  by  the  same 
number  of  15x10  excitatoiy  and  inhibitory  cells.  Each  cell  of  the  cochlear  layer  projects  to  a  band 
of  3  by  10  subcortical  excitatory  (SE)  cells;  as  shown  in  Figures  6  and  7  preserving  the 
isofrequency  representation  along  one  dimension  of  subcortical  layer.  Each  SE  cells  sends 
efferents  to  excitatory  cells  (CE)  of  the  cortical  layer  with  the  isofrequency  band  oriented  ventro- 
dorsally  (Figure  6),  with  increasing  spreading  of  connection  in  the  dorsal  direction.  In  other 
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words,  each  SE  cell  projects  to  topographically  equivalent  3x3,  5x5  or  7x7  neighborhood  of  CE 
cells.  On  the  vcntro-dorsal  orientation,  the  first  four  SE  cells  projected  to  3x3  QE  cells,  the  next 
three  cells  to  5x5  and  finally  three  more  cells  to  7x7  cells.  In  order  to  establish  planar  border 
condition,  lower  frequency  edge  of  networic  has  been  made  contiguous  to  the  high  frequency  end, 
and,  due  to  the  increasing  spreading  of  connectivity,  the  network  sUticture  is  not  homogeneous 
along  the  isoffequency  ban^  so  that  the  subcortical  and  cortical  layers,  both  inhibitory  and 
excitatory,  networks  were  doubled  in  a  mirror  image  to  make  the  transition  smooth.  So  the 
simulated  network  has  20  by  15  cells  matrices  in  the  four  subcortical  and  cortical  layers.  In  the 
subcortical  layer,  each  excitatory  cell  projects  to  3x3  neighboring  inhibitory  (SI)  cells  and  in  turn 
each  SI  sends  connection  to  5x5  neighbaiing  SE  cells  (see  Figure  8).  Similarly  in  the  cortex,  each 
CE  cell  projects  to  neighboring  3x3  cortical  inhibitory  (Cl)  and  excitatory  (CE)  cells;  and  each  Cl 
cell  is  connected  to  5x5  CE  cells.  And  finally,  CE  ceUs  project  back  to  topographically  equivalent 
5x5  inhibitory  and  excitatory  subcortical  ccUs  (figure  7). 

We  used  a  similar  neuronal  model  presented  previously  by  Grajski  and  Merzcnich  (1989). 
The  stimulus  pattern,  equivalent  to  monotonic  sound,  was  a  triangular  shap)ed  input  excitation 
pattern  exciting  5  cells  of  the  cochlear  layer,  nic  cell  model  is  the  same  for  inhibitory  and 
excitatory  cells.  The  cells  membrane  potential  is  given  by  following  equation: 

=  • 

^  m  n  ■*  P 

0‘a  =  V- VI'S  „ , 

U‘,M  =  -Tm.U‘^  +  , 

u‘s, = -Tm.  yi + 2  V- tlx  • 

where  U'^  and  respKXtively,  membrane  potential  and  cell  firing  rate,  ranging  from  0  to 

100%,  for  post-synaptic  unit/cell  i  of  layer  X  with  cell  type  Y;  Tm.  is  the  membrane  time  constant; 
M,  N  and  P  are  total  number  of  input  to  the  specific  post-synaptic  cell;  is  the  synaptic 

strength  from  the  pre-synaptic  p-th  cell  of  layer  X  of  cell  type  Y  to  the  post-synaptic  i-th  cell  of 
layer  X'  and  cell  t^e  Y'.  CO  stands  for  cochlear  excitatory  ceU.  The  firing  rate  of  each  cell  is 
given  by  tlie  following  sigmoidal  transformation  of  membrane  potential, 


^  1^(1  +  tanh(4(t/‘  - 1)))  {/■  ^  0.02, 

.0  (7*  <0.02. 

The  synaptic  strength  is  updated  by  a)  Hebbian  activity-dependent  changes  ,  b)  passive  decay, 
that  can  be  expressed  by 

U'«(r)=  lV‘"(r-l)-  x,y.V/‘”{t~\)+aV‘V” 

=  (1  -  x,yn)W^{t  -l)  +  aVV” , 
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is  the  synaptic  strengtn  decay  constant  and  a  the  activity-dependent  synaptic  change  constant 
for  each  integration  step,  and  c)  normalization,  in  which  case  the  sum  of  all  synaptic  strength  to 
cell  i  divided  by  the  number  of  connections  is  kept  constant  to  R=2  for  all  ceUs, 

N‘y 

R  is  called  the  total  resource  of  cells. 

The  numerical  integration  was  carried  out  with  a  backward-Euler  algorithm  using  integration  time- 
steps  of  0.2  unit  time,  iiyiiMl.Ol,  Tm=0.2  and  a  =  0.05. 

The  network  was  trained  without  supervision  by  applying  monotonic  tone  inputs  to  the  cochlea, 
with  triangular-shaped  activation  of  hair  cells,  in  random  frequency  order.  At  the  beginning  of 
each  stimulation  cycle,  the  membrane  potential  was  set  to  zero,  then  stimulus  were  applied  for  two 
unit  time  and,  at  the  end  of  five  unit  times,  synaptic  strengths  were  updated  and  norr^zed.  The 
training  session  was  finished  when  the  synaptic  strength  changes  were  smaller  than  5%  after  10 
stimulation  cycle  of  the  complete  set  of  15  inputs  (number  of  cells  on  frequency  dimension). 

Figure  9  shows  the  outcome  of  the  cciodel  when  stimulated  with  a  pure  tone  of  medium  high 
intensity.  The  two  left  panels  indicate  the  activity  distribution  of  the  excitatory  and  inhibitory 
cortical  layers  and  the  two  panels  on  the  right  illustrate  the  activation  of  the  subcortical  layers. 

Note  that  the  activity  profile  in  the  excitatory  cortical  layers  shows  an  elevated  activity  along  the 
isofr^uency  axis  auci  a  local  minimum  in  the  ventro-dorsal  center  similar  to  observation  in  the 
physiology.  The  rr  ^on  with  a  minimum  excitatory  activity  corresponds  to  a  region  of  high 
inhibitory  activity  in  the  subcortical  layers  Oower  right  panel).  Interestingly,  the  coitical  inhibitoiy 
layer  also  shows  a  minimum  at  the  location  of  reduced  excitatory  cortical  activity  suggesting  that 
the  strength  of  iniiibitioi;  in  the  center  of  AI  may  not  be  the  only  parameter  that  influences  the 
degree  of  nonmonotonicity  of  cortical  rate/levei  functions. 

At  this  early  stage  of  the  model,  several  basic  aspects  need  to  be  studied  and  improved  before 
the  outcome  of  the  rnodelling  can  be  interpreted  with  confidence.  Among  tliem  are  the  following: 
1)  The  influence  of  each  learning  and  normalization  parameter  on  the  belmvior  of  the  network;  2) 
The  use  of  larger  networks  in  both  dimension,  so  that  we  will  be  able  to  assess  with  finer  detail 
cortical  response  representations;  3)  Addition  of  more  anatomical  mxormation  into  tlic  model  while 
keeping  the  general  simplicity  of  the  layout  thus  allowing  reasonable  margins  of  analyzing  factors 
influencing  the  network  behavior,  4)  Training  the  network  with  mere  natural  stimuli  sets. 

Kohonen  model:  Our  experimental  findings  .*.uggest  that  the  global  organization  of  auditory  cortex 
can  be  described  as  the  topographical  representation  of  sevei^  independent  stimulus  domains.  A 
factor  analysis  of  our  data  indicated  at  least  four  independent  parameters,  namely  frequency, 
bandwidth,  intensity,  and  binaural  interaction.  Although  it  is  possible,  that  other  independent 
parameters  arc  also  represented,  e.g.  temporal  properties,  we  have  at  this  time  no  clear  evidence 
regarding  the  natuie  of  those  parameters  or  their  distributior  pattern  in  AI  (but  see  Objective  1). 
Based  on  these  four  identified  feature  dimensions,  a  reduction  in  dimensionality  has  to  take  place 
in  order  to  project  these  four  dimension  onto  the  two  spatial  dimensions  of  au^tory  cortex  studied 
so  far  (all  of  our  investigations  have  ccnccntrated  on  response  proi>crtics  in  layers  mb  and  IV). 
One  class  of  models  that  appjcar  to  be  ideally  suited  is  the  dimensionality-reducing  mapping 
approach  using  Kohonen  algorithms  (e.g.  Kohonen,  1982). 

In  an  attempt  to  produce  evidence  that  the  mapping  several  parameters  onto  the 
isofirequency  domain  of  AI  is  compatible  with  principles  of  topology-preserving  mapping 
emerging  in  Kohonen  networks,  we  designed  a  simple  nctwo^  that  projected  two  dimensions  to 
a  linear  array.  The  two  dimensions  were  selected  to  represent  tlie  bandwidth  feature  dimension 
and  the  intensity  feature  dimension  as  seen  in  cortical  maps  along  the  isofiequcncy  lines.  The 
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linear  array  corresponds  to  neurons  along  the  isofiequency  axis  of  AI.  Two  thousand  randomized 
input  coordinates  of  the  two  training  dimensions  were  selected  wnthout  bias  for  any  parameter 
combinations.  The  output  array  had  40  elements.  During  the  learning  process,  every  unit  of  the 
output  array  (neurons  in  the  isofrequency  domain)  will  become  maximally  sensitiz^  to  a 
particular  location  in  the  two-dimensional  input  array  with  different  units  ^ing  sensitized  in  an 
orderly  fashion.  By  plotting  the  output  array  within  the  original  two-dimensional  input  plane,  the 
generd  mapping  can  be  visualized.  The  left  column  in  Figure  10  shows  five  different  outcomes  of 
such  a  mapping  after  32000  cycles  of  training  (the  patterns  were  already  stable  after  about  5(X)0  to 
1000  cycles).  Note  that  tlie  shape  or  trajectory  of  the  linear  array  acix^ss  the  input  dimensions 
shows  different  pattern  such  as  a  circle,  ar.  3-shape,  or  a  figure  eight  Parameters  such  as  rate  of 
learning,  number  of  inputs,  and  the  number  of  training  cycles,  were  kept  constant.  The  only 
parameter  that  was  changed  was  the  size  of  the  neighborhoo^i  !.>.  was  included  into  the  npdaring 
process.  The  largest  neighboihood  size  was  39  (upper  panel;  and  the  smallest  neighborhood  was 
2  (lowest  panel).  Note  the  increase  in  complexity  of  the  trajectories  for  the  smaller  neighborhood 
sizes. 

In  order  to  allow  a  more  direct  comparisons  with  the  experimental  data,  the  projections  of  the 
trajectory  on  the  two  parameter  axes  is  plotted  in  the  right  column  of  Figure  10.  The  dotted  lines 
show  tlie  change  of  the  bandwidth  factor  along  the  hypothetical  isofiequency  domaii:  at'id  the  solid 
line  shows  the  spatial  variation  of  the  intensity  factor.  Two  aspects  of  these  single  factor 
trajectories  should  be  noted.  First,  the  topology-preserving  mapping  results  in  fairly  smooth  and 
nearly  sinusoidal  variations  of  each  parameter  tli^ugh  its  parameter  space.  However,  the  spatial 
frequency  of  this  parameter  sequence  can  change.  This  is  indicated  by  the  different  spatial 
frequency  estimates  noted  for  each  trajectory.  The  spatial  fiequency  values  for  the  intensity  factor 
(SF(I))  vaiy  between  1  cycle  along  the  isofrequency  domain  (upper  diagram)  to  2  cycles  (lower 
diagram).  In  this  case  the  spatial  fiequency  of  the  bandwidth  factor  (SF(B))  varied  between  0.5 
and  1  cycles.  Second,  despite  the  smooth  and  approximately  sinusoidal  course  of  the  parameter 
trajectories,  essential  all  trajectories  show  some  regions  of  tittle  change  of  one  or  both  of  the  two 
parameters,  resulting  in  regions  with  fairly  similar  parameter  values.  Both  of  these  effects,  fairly 
smooth  general  gradients  of  parameters  along  the  isofrequency  lines  with  spatial  frequencies  of 
the  same  order  of  magnitude  and  small  regions  with  a  constant  parameter  value,  arc  also 
characteristic  for  the  parameter  distribution  actually  observed  in  AI.  Figure  1 1  shows  the  actual 
trajectories  of  bandwidth  and  intensity  factors  along  the  isofiequency  axis  of  five  cats.  Although 
these  real  data  are  more  noisy  than  the  model  maps,  similarities  to  the  model  results  can  be  noted. 
Some  two-dimensional  trajectories  ‘'.how  an  elliptical  shape,  other  resemble  a  figure  eight. 
Accordingly,  the  trajectories  plotted  for  each  parameter  separately  show  different  spati^ 
frequencies.  ITie  actual  spatial  frequencies  are  between  0.5  and  2.5  not  unlike  the  range  emerging 
from  the  model.  In  addition,  the  real  parameter  distributions  show  a  'patchiness',  or  regions  of 
little  change  in  a  given  parameter.  This  patchiness  has  been  used  by  oAers  to  argue  against  the 
existence  of  real  parameter  gradients  in  the  organization  of  the  isofrcquency  domain.  From  the 
modeling  results  it  appears  that  general  gradients  and  local  patches  arc  both  necessary 
consequences  of  an  selforganizing,  dimension-reducing,  topology-preserving  algorithm. 

The  input  distributions  of  the  parameters  to  the  Kohonen  network  covert  all  parameter 
combinations  with  equal  probability.  Tliis  is  not  necessarily  the  case  for  the  real  auditory  system. 
In  fact,  the  course  of  some  of  the  trajectories  (oblique  ellipses)  indicate  that  certain  paranietcr 
combinations  appear  to  be  slightly  overrepresented.  Whether  this  is  a  consequence  of  the  actual 
input  probability  distributions  or  might  be  related  to  the  existence  of  other  parameters  that  are  also 
projected  onto  this  space  and  may  distort  the  representation  of  these  two  dimension  remains  to  be 
seen. 

It  is  concluded  that  selforganizing  Kohonen  networks  arc  in  reasonable  agreement  with 
actually  observed  cortical  parameter  representations  and  that  smooth  pamneter  gradients  with 
occasional  patchiness  is  a  natural  consequence  of  topology  preserving  algorithms. 
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Summary: 

Combined,  the  cortical  experiments  and  modelling  attempts  represent  an  approach  to 
bioacoustic  signal  classification  that  is  based  on  the  biological  transformation  and  parametric 
representation  of  complex  signals  in  the  mammalian  CNS  of  animals  with  'open  reception', 
si^ar  to  humans.  Application  of  similar  principles  to  technical  signal  classification  schemes 
should  result  in  an  improvement  of  the  classification  success  since  the  utilized  mechanisms 
contain  biological  proven  aspects  that  aid  the  animal  in  a  robust  signal  classification.  The  parallel 
electrophysiological  and  modelling  study  of  the  cortical  representation  of  signals  in  noisy  and 
quiet  background  have  revealed  mechanisms  that  are  responsible  for  the  pre-processing  in 
successful  classification  schemes  and  can  enhance  our  u^erstanding  of  ^e  signal  analysis 
properties  and  their  implementation  in  the  CNS. 
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Figure  Legends: 

Figure  I :  Rcvcnc  coneUuiun  of  neuronal  respomes  ui  iiuduar>'  cones  vkith  uiput  signal.  A;  Ptc- 
cveni  stimulus  ensemble  (PESLi  of  a  cortical  neuron  in  response  to  vocalizations  A  tune  micrval  of 
30ms  before  the  gcncrauon  of  an  acoon  potential  is  shown  The  signals  that  proceed  each  spike 
(N>  100)  have  been  averaged  in  1 28  frquency  channels  after  lowpass  filtering  each  channel.  The 
resulting  tcmprual  envelop  average  is  proportional  to  the  probability  that  a  certain  stimulus 
configuration  has  to  oocuf  before  a  spike  can  be  elicited.  Red'  probability  high  above  average;  green: 
average  probability,  blue:  probability  below  average.  B.PESL  for  pure  tone  stimulation.  Same 
convention  as  for  panel  A.  C:  Color  inverted  PLSE  for  vocalizations  (panel  A),  blue:  probability  high 
above  average:  green;  average  probability,  red:  probability  below  average.  D:  I^uency  response 
area  of  neuron.  67S  different  tones  were  used  to  generate  the  intcnsity-fi^uency  response  profile. 
Red:  evoked  response. 

F'igure  2;  Reverse  correlation  from  neuronal  responses  in  auditoiy  cortex. Same  convention  as  in 
Figure  1. 

Figure  3:  Modelled  neuronal  activity  profile  across  auditory  conex.  The  neuronal  activity  (lealdvc 
spike  rate)  is  plotted  as  elevation  The  other  tvo  axis  correspond  to  the  isofrequency  axis, 
presented  by  133  neurons  and  the  frequency  axis  represented  in  octaves.  Low  numbers  along  the 
isofrequency  oxis  correspond  to  the  dorsal  portion  of  AI.  The  stimulus  was  a  pure  tone  with  a 
frequency  corresponding  to  the  center  of  the  frequency  axis  and  an  intensity  of  lOdB. 

Figure  4;  Modelled  neuronal  activity  profile  across  auditory  cortex.  The  stimulus  was  a  pure  tone 
with  a  frequency  corresponding  to  the  center  of  the  frequency  axis  and  an  intensity  of  43d5.  Same 
conventions  as  in  Figure  3. 

Figure  5:  Modelled  neuronal  activity  profile  across  auditory  cortex.  The  stimulus  was  a  pure  tone 
with  a  ft^ucncy  corresponding  to  the  center  of  the  frequency  axis  and  an  intensity  of  45dB.  Same 
conventions  as  in  Figure  3. 

Figure  6:  Connectivity  organization  in  cortical  nxxicl.Thc  first  layer  (cochlear  input)  consists  of  a  one 
dimensional  array  of  1 5x  1  cells  representing  the  cochlear  band-pass  filter  and  hair  cells.  The 
subcortex  has  two  layers,  one  cxcitaiory  and  one  inhibitory,  of  15x10  cells.  The  cortex  is 
represented  by  the  same  number  of  1 5'  10  excitatory  and  inhibitory  cells.  Each  cell  of  the  cochlear 
layer  projects  to  a  band  of  3  by  10  subcortical  cxcitatcwy  (SE)  cells  preserving  the  isofrcquency 
representation  along  one  dimension  of  subcortical  layer.Thc  spread  of  connections  (light  shade) 
away  from  the  isofr^ucncy  axis  is  uniform  in  the  subcortical  layer  and  not  uniform  in  the  cortical 
layer. 

Figure  7:  Each  SE  cell  projects  to  topographically  equivalent  3x3, 5x5  or  7x7  neighborhood  of  CE 
ccUs.  On  the  ventro-dorsal  orientation,  the  first  four  SE  cells  projected  to  3x3  CE  cells,  the  next  three 
cells  to  5x5  and  finally  three  more  cells  to  7x7  cells. 

Figure  8:  In  the  subcortical  layer,  each  excitatory  cell  projects  to  3x3  neighboring  inhibitory  (SI) 
cells  and  in  turn  each  SI  sends  connection  to  5x5  neighboring  SE  cells. 

Figure  9:  Activity  distribution  in  subcortical  and  cortical  layers  when  stimulated  with  tone  of 
moderate  intensity.  The  two  left  panels  indicate  the  activity  distribution  of  the  excitatory  and 
inhibitory  cortical  layers  and  the  two  panels  on  the  right  illustrate  the  activation  of  the  subcortical 
layers. 

Figure  10:  Ci)itical  dimension  reduction  and  spatial  frequency  emergence  based  on  Kohonen 
mapping.  A  two-dimensional  array  was  projected  onto  a  linear  array  using  a  Kohonen  algorithm. 
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The  left  column  shows  the  results  of  five  different  runs  of  the  model.  Each  of  the  linear  array 
neurons  (N=40)  is  plotted  at  the  location  of  the  original  input  space  that  provides  the  strongest  input 
(strongest  weight).  Parameter  is  the  size  of  the  neighborhood  included  in  the  updating  of  the 
weights.  Upper  panel:  Neighborhood  size  =  39.  lowest  panel:  Neighborhood  sizc:2.  In  the  right 
columns,  the  trajcctones  are  plotted  for  each  input  dimension  separately.  Dashed  line:  bandwidth 
dimension;  solid  Unc;  intensity  dimension.  The  spatial  frequency  (SF)  was  obtaiiied  for  each  curve 
by  esdmating  the  lowest  Fourier  component  that  would  describe  each  trajectory. 

Figure  1 1 :  Actual  trajectories  in  bandwidth-intensity  space  for  the  isofircqucncy  domain  (10- 12kHz) 
of  five  animals.  The  actual  trajectories  was  extracted  form  the  data  by  reducing  the  local  parameter 
scatter  through  a  moving  average  window  of  length  10.  Conventions  same  as  in  Figure  10. 
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SUMMARY  AND  CONCLUSIONS 

/ .  The  spatial  distribution  of  the  sharpness  of  tuning  of  si'tgle 
neurons  riong  the  dorsoventral  extent  af  primary  auditory  cortex 
(All  was  studied.  A  sharpness  of  tunitig  gradient  was  initially  ob¬ 
tained  with  multipie-unit  recordings,  and  in  combination  with  the 
cochleotopic  organization,  served  as  a  frame  of  reference  for  the 
locations  of  single  neurons.  The  frequency  selectivity  or  “inte¬ 
grated  excitatory  bandwidth"  of  multiple  units  varied  systemati¬ 
cally  along  the  dorsoventral  extent  of  Al.  The  most  sharply  tuned 
unit  clusters  were  found  at  the  approximate  center  of  the  dorso¬ 
ventral  extent.  A  gradual  broadening  of  the  integrated  excitatory 
bandwidth  in  both  dorsal  and  ventral  directions  was  consistently 
seen. 

2.  The  multiple-unit  measures  of  the  bandwidth  10  (BWIO) 
and  40  dB  ( BW40 1  above  minimum  threshold,  pooled  across  sev¬ 
eral  animals  and  expressed  in  octaves,  were  similar  to  those  de¬ 
scribed  within  individual  cases  in  cats.  As  m  the  individual  ani¬ 
mals.  the  bandwidth  maps  were  V  shaped  with  minima  located  at 
the  approximate  center  of  the  dorsal-ventral  extent  of  Al.  The 
location  of  the  minimum  in  the  multiple-unit  bandwidth  map 
( i.e..  the  most  sharply  tuned  area)  was  used  as  a  reference  point  to 
pool  single-neuron  data  across  animals. 

J.  For  single  neurons,  the  dorsal  half  of  the  BW40  distribution 
showed  a  gradient  paralleling  that  found  for  multiple  units.  For 
both  single  and  multiple  units,  the  average  excitatory  bandwidth 
increased  at  a  rate  of  —0.27  octaves/ mm  from  the  center  of  Al 
toward  the  dorsal  fringe.  Differing  from  the  dorsal  half  of  Al,  the 
ventral  half  of  Al  showed  no  clear  BW40  gradient  for  single  units 
along  its  dorsoventral  extent.  At  40  dB  above  minimum  threshold, 
most  ventral  neurons  encountered  were  sharply  tuned.  By  con¬ 
trast.  the  multiple-unit  BW40  showed  a  gradient  similar  to  the 
dorsal  half  with  0.23  octaves/ mm  increasing  from  the  center  to¬ 
ward  the  ventral  border  of  Al. 

4.  For  single  neurons,  BWIO  showed  no  clear  systematic  spa¬ 
tial  distribution  in  Al.  Neither  the  dorsal  nor  the  ventral  gradient 
was  significantly  different  from  zero  slope,  although  the  dorsal  half 
showed  a  trend  toward  increasing  BWiOs.  Contrasting  single  neu¬ 
rons.  both  dorsal  and  ventral  halves  of  Al  showed  BW 1 0  slopes  for 
multiple  units  confirming  a  V-shaped  map  of  the  integrated  excit¬ 
atory  bandwidth  within  the  doisoventral  extent  of  Al. 

5 .  On  the  basis  of  the  distribution  of  the  integrated  ( multiple- 
unit)  excitatory  bandwidth.  Al  was  parceled  into  three  regions;  the 
dorsal  gradient,  the  ventral  gradient,  and  the  central,  narrowly 
tuned  area.  In  ventral  Al,  single  units  were  significantly  more 
sharply  tuned  than  multiple  units  for  BWIO  and  BW40.  In  dorsal 
Al.  single  units  were  not  sutistically  different  from  multiple  units 
for  BW40.  In  central  Al.  single  units  were  significantly  sharper  for 
BW40,  but  not  BWIO. 

6  Estimates  of  the  scatter  of  characteristic  frequency  (CF)  of 
single  neurons  in  the  dorsoventral  extent  domain  were  obtained 
relative  to  the  frequency  oi-ganization  determined  with  multiple 
units.  The  central  narrowly  tuned  region  showed  the  Irast  CF scat¬ 


ter.  The  CF  scatter  increased  toward  the  dorsal,  and  in  particular, 
the  ventral  end  of  the  dorsoventral  extent  of  .Al. 

7 .  The  combined  single-  and  multiple-unit  results  suggest  that 
Al  is  composed  of  at  least  two  functionally  distinct  subregions 
along  the  dorsoventral  extent/  isofrequency  domain  on  the  basis 
of  the  bandwidth  propenies  of  tuning  curves.  The  dorsal  region 
(Aid)  displays  a  global  gradient  of  BW40  expressed  in  single-  and 
multiple-unit  measurements,  and  contains  broadly  and  sharply 
tuned  single-peaked  as  well  as  multipcaked  neurons.  The  ventral 
region  ( Alv )  predominantly  conuins  neurons  narrowly  tuned  at 
40  dB  above  threshold.  The  increase  in  integrated  ( multiple-unit ) 
excitatory  bandwidth  in  Alv  can  be  related  to  a  progressively 
larger  CF  scatter  at  more  ventral  locations.  The  transition  between 
Aid  and  Alv  is  delineated  by  a  reversal  in  the  BW40  gradient  for 
multiple  units  coinciding  with  a  region  of  small  CF  scatter  and 
sharp  tuning  of  the  underlying  single  neurons. 


INTRODUCTION 

Recent  studies  of  cat  auditory  cortex  have  demonstrated 
topographic  order  within  the  elearophysiologically  deter¬ 
mined  organization  of  the  dorsoventral  extent  of  the  pri¬ 
mary  auditory  cortical  held  (Al)  (Mendelson  et  al.  1988; 
Schreiner  and  Cynader  1984;  Schreiner  and  Mendelson 
1990;  Schreiner  etal.  1988;  Sutter  and  Schreiner  199la)in 
addition  to  previously  described  organizations  relating  to 
binaural  response  properties  ( Imig  and  Adrian  1977;  Imig 
ct  al.  1990;  Middlebrooks  et  al.  1980;  Ritjan  et  al.  1990; 
Reale  and  ECettner  1986).  Several  response  parameters,  in¬ 
cluding  sharpness  of  frequency  and  amplitude  tuning,  ob¬ 
tained  from  multiple-unit  recordings,  were  found  to  be 
notirandomly  distributed  along  the  dorsoventral  dimen¬ 
sion  of  .M.  These  findings  suggest  a  systematic  functional 
organization  in  the  isof^uency  domain  of  Al  that  in¬ 
volves  the  coding  of  basic  signal  properties,  among  them 
signal  intensity  and  spectral  complexity  (Schreiner  and 
Mendelson  1 990;  Shamma  and  Beaman  1990;  Sutter  and 
Schreiner  1991a).  The  functional  interpretation  of  topogra¬ 
phies  based  on  multiple-unit  recordings  is,  however,  lim¬ 
ited  and  can  only  represent  an  approximation  of  actual  cor¬ 
tical  processing.  Ultimately,  the  properties  of  single  neu¬ 
rons  underlying  those  integrated  measures  of  cortical 
activity  must  be  analyzed  and  related  to  multiple-unit  find¬ 
ings  to  better  understand  the  basic  principles  and  spatial 
distribution  of  physiological  characteristics  and  funaional 
organization  of  Al. 

Studying  the  spatial  distribution  of  single-neuron .  re¬ 
sponse  properties  across  large  portions  of  Al  has  proven  to 
be  a  difficult  task  for  a  number  of  reasons.  Historically. 
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single-unii  studies  that  used  anatomically  based  pooling 
strategies  ( c.g..  Evans  and  Whitlield  1964:  Goldstein  ei  al. 
1970 )  failed  to  fully  support  basic  organizational  principles 
such  as  a  strong  cochleotopicity  of  Al  as  obtained  with  inte¬ 
grated  responses.  For  example,  using  sulcal  patterns  and 
vasculature  as  landmarks  for  pooling  topographical  data 
across  animals  is  untenable  because  Ixith  display  a  large 
variability  among  animals  (  Merzctiich  ct  al.  1975 1.  .Addi- 
tionallv.  the  cyioarchitectonic  boundaries  of  .Al  are  not  of 
sufficient  sharpness  (e.g..  Rose  1949;  Winer  1984)  to  allow 
a  precise  alignment  of  locations  vviihin  a  given  field  from 
different  animals.  Characterizing  a  sufficient  number  of 
single  neurons  in  one  experiment  to  construct  a  reliable 
physiological  map  is  difficult  in  a  cortical  field  of  the  extent 
of  cat  .A I. 

An  alternative  to  utilizing  anatomic  landmarks  is  the  use 
of  known  systematic  spatial  distributions  of  physiological 
features  that  may  provide  information  for  the  proper  spa¬ 
tial  alignment  of  fields  from  several  individual  cortices.  The 
most  basic  choice  is  the  use  oT  the  cochleotopic  ftwiucncy 
gradient,  reliably  determinable  with  multiple-unit  record¬ 
ings  le.g.,  Merzenich  et  al.  197S;  Reale  and  Imig  1980; 
Schreiner  and  Mendelson  1990),  to  determine  the  rostro- 
caudal  extent  of  the  field  and  to  identify  locations  within 
this  dimension.  However,  a  second  parameter  is  necessary 
to  align  locations  along  the  dorsoventral  extent  of  Al.  i.e.. 
approximately  along  its  isofrequency  domain.  The  clus¬ 
tered  distribution  of  binaural  response  charaacristics  along 
the  dorsoventral  extent  of  Al  is  quite  variable  from  animal 
to  animal  ( Imig  and  Adrian  1977;  Imig  and  Brugge  1978; 
Irnigand  Reale  1981;  Middiebrooks  et  al.  1980;  Rnaleand 
Kettner  1986;  Schreiner  and  Cynader  1984)  and.  therefore, 
seems  not  to  be  suitable  as  a  basis  for  a  spatial  normaliza¬ 
tion  or  alignment  of  Al  from  difterent  cortices. 

A  second  physiological  parameter  that  may  be  particu¬ 
larly  useful  in  aligning  cortical  fields  is  the  spatial  distribu¬ 
tion  of  the  integrative  cxciutory  bandwidth  along  the  dor¬ 
soventral  dimension  of  Al.  A  recent  study  (Schreiner  and 
Mendelson  1990)  revealed  a  region  ofshaip  frequency  tun¬ 
ing  of  multiple-unit  resFwnses  within  the  central  portion  of 
the  isofrequency  domain  contiguous  with  ventral  and  dor¬ 
sal  gradients  of  decreasing  frequency  seieaivity.  This  spa¬ 
tial  distribution  of  the  integrated  exciutory  bandwidth  in 
Al  ( Schreiner  and  Cynader  1984;  Schreiner  and  Mendelson 
1990)  appears  to  be  /)  similar  and  less  variable  from  ani¬ 
mal  to  animal  and  2)  appears  to  be  consistent  over  a  wide 
range  of  frequencies.  Therefore  it  may  serve  as  a  topo¬ 
graphical  frame  of  reference  for  cempanng  and  pooling 
data  from  different  animals.  In  a  previous  report  this  ap¬ 
proach  was  successfully  utilized  to  determine  the  spati^ 
distribution  of  a  subpopulation  of  auditory  conical  neu¬ 
rons.  namely  neurons  with  multipeaked  tuning  curves,  in 
the  dorsoventral  dimension  of  Al  (Sutter  and  Schreiner 
1991a).  By  pooling  data  across  animals,  a  sufficient  num¬ 
ber  of  topographically  identifiable  neurons  with  multi¬ 
peaked  tuning  curves  was  obtained  to  derive  a  statistically 
!.':cured  estimation  that  their  locations  were  essentially  con¬ 
fined  to  the  dorsal  part  of  Al.  This  paper  explores  the  prop¬ 
erty  and  topography  of  frequency  seieaivity  of  single- 
neuron  responses  along  the  dorsoventral  extent,  approxi¬ 
mating  the  isofrequency  axis  of  cat  Al.  Single-neuron 


positions  were  obtained  relative  to  gradients  in  integrated 
excitatory  bandwidth  determined  with  multiple-unit  re¬ 
cordings.  Some  results  have  previously  been  presented  in 
abstract  form  (Sutter  and  Schreiner  199 lb). 

METHOD 

Surjik'al  preparation 

The  methods  are  similar  to  those  described  in  Sutter  and 
Schreiner  ( I99la>.  Experimenu  were  conduaed  on  nine  young 
adult  cats.  Anesthesia  was  induced  with  an  intramuscular  injec¬ 
tion  of  ketamine  hydrochloride!  10  mg/ kg)  and  acetylpromazine 
maleatelO.28  mg/kg).  Alter  venous  cannulation  an  initial  dose  of 
pentobarbital  sodium  (to  effect.  ~30  mg/kg)  v^as  administered. 
Animals  were  mainuiited  at  a  surgic.'J  level  of  anesthesia  vvith  a 
continuous  infusion  ufpentobarbital  sodium  (2  mg -kg  -h  )m 
Uctated  Ringersolution(  infusion  volume.  3.5  ml/h  land,  if  neces¬ 
sary.  with  supplementary  intravenous  injeaions  of  pentobarbital 
sodium.  The  cats  were  aiso  given  dexamethosone  sodium  phos¬ 
phate  (0. 14  mg/ kg  im)  to  prevent  brain  edema,  and  atropine  sul¬ 
fate  ( I  mg  im)  to  reduce  s^ivati'vn.  The  temperature  of  the  ani¬ 
mals  was  recorded  with  a  recta)  temperature  probe  and  main- 
uined  at  37.S*C  by  means  of  a  heated  water  blanket  with  feedback 
control. 

The  head  was  fixed,  leaving  the  external  meati  unobstructed. 
The  temporal  muscle  over  the  right  hemisphere  was  then  retracted 
and  the  lateral  cortex  cx)XMcd  by  a  craniotomy.  The  dura  overlay¬ 
ing  the  middle  ectosylvian  gyms  was  removed,  the  cortex  was 
covered  with  silicone  oil.  and  a  photograph  of  the  surface  vascula¬ 
ture  was  taken  to  record  tfie  electrode  penetration  sites.  For  record¬ 
ing  topjgrapliically  identified  single  neurons,  a  wire  mesh  was 
placed  over  the  craniotomy,  and  the  space  between  the  grid  and 
cortex  was  filled  with  a  1  %  solution  of  clear  agarose.  This  approach 
diminished  pulsations  of  the  cortex  and  provided  a  fairly  unob¬ 
structed  view  of  identifiable  loations  across  the  exposed  conical 
surface. 

Stimulus  generation  and  delivery 

Experiments  were  conducted  in  a  double-walled  sound-shielded 
room  ( lAC).  Auditory  stimuli  were  presented  via  a  sound  delivery 
system  designed  to  provide  essentially  flat  transfer  funaion  when 
conneaed  to  the  average  cat  ear  (Sokolich  1981.  US  Patent 
4251686).  Headphones  (STAX  54)  enclosed  in  small  chambers 
were  connected  to  sound  delivery  tubes  sealed  into  the  acoustic 
meati.  The  sound  delivery  system  was  calibrated  in  vitro  with  a 
sound  level  meter  ( BrUel  Si  Kjaer  2209)  and  a  waveform  analyzer 
((jcneral  Radio  1521-B).  The  frequency  response  of  the  system 
was  essentially  flat  up  to  14  kHz  and  did  not  have  major  reso¬ 
nances  deviating  more  than  ±6  dB  from  the  average  level.  Above 
14  kHz.  the  output  rolled  off  at  a  rate  of  10  dB/octave.  However, 
the  aaual  in  vivo  transfer  function  may  have  deviated  from  this 
estimate.  Harmonic  distortion  was  below  0.18%  ( -35  dB)  for  the 
maximum  input  voltage  to  the  speakers  of  0.35  Frms-  No  influ¬ 
ences  of  harmonic  distortion  on  the  obtained  tuning  curves  was 
ever  noted. 

Tones  were  generated  by  a  microprocessor  (TMS320 10:  16-bit 
D/A  converter  at  120  kHz  sampling  rate;  low-pass  filter  of  96 
dB/octave  ai  15. 35.  or  50  kHz).  The  amplitude  of  the  ^nals  was 
controlled  by  varying  the  number  of  amplitudes  steps  in  generat¬ 
ing  the  waveform  (each  step  corresponded  to  —0.15  mV).  The 
highest  number  of  amplitude  steps  was  63.246  and  the  lowest  num¬ 
ber  was  20.  resulting  in  a  useful  dynamic  range  of  70  dD.  Addi¬ 
tional  attenuation  was  provided  by  a  pair  of  passive  attenuators 
( Hewlett  Packard ) .  The  duration  of  the  tone  bursts  was  usually  50 
ms  (tone  bursts  were  extended  to  85  ms  for  long--aiency  re- 
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sponscs  I  including  ?-m5  nse/tall  lime,  The  interstimulus  intenal 
v^as  400-1.000  ms, 

Frequency  response  areas  ( FR.isj 

FRAs  \^erc  obtained  tor  each  neuron.  To  generate  an  FR.A.  at 
least  675  different  tone  bursts  uere  delivered.  Tone  bursts  were 
presented  m  a  pseudorandom  sequence  of  different  frequency/ 
level  comomations  selected  from  1 5  level  values  and  43  frequency 
values.  Steps  between  levels  were  5  dF  resulting  in  a  sampled 
dynamic  range  of  70  dB. 

The  frequency  range  covered  by  the  45  frequency  steps  was 
centered  around  the  estimated  characteristic  frequency  ( CF )  of 
the  recording  site  and  covered  between  2  and  5  octaves,  depending 
on  the  width  of  the  frequency  tuning  curve  as  obtained  by  audiovi¬ 
sual  criteria.  Stimulus  frequencies  were  chosen  so  that  the  43  pre¬ 
sented  frequencies  were  spaced  an  equal  fraction  of  an  ocuveover 
the  entire  range  ( for  most  cases  this  provided  a  0.067-octave  reso¬ 
lution  over  a  total  of  3  octaves ) . 

Recording  procedure 

Parylene-coated  tungsten  microelectrodes  (Microprobe)  with 
imp^nces  of  1 .0-8.5  MO  at  I  kHz  were  introduced  into  the 
auditory  cortex  with  a  hydraulic  microdrive  ( fCopf)  remotely  con¬ 
trolled  by  a  stepping  motor.  All  penetrations  were  essentially  or¬ 
thogonal  to  the  brain  surface.  The  recordings  reported  here  vx^ere 
derived  at  intraconical  depths  ranging  from  6(X)  to  1.000  nm  as 
determined  by  the  microdrive  setting,  corresponding  to  portions 
of  cortical  layers  3  and  4.  Neuronal  activity  of  single  neurons,  and 
for  the  initial  mapping,  small  groups  of  neurons  were  amplified, 
band-pass  filteted.  and  monitored  on  an  oscilloscope  and  an  audio 
monitor.  Aaion  potentials  were  isolated  from  the  background 
noise  with  a  window  discriminator  ( BAK  DIS- 1 ).  The  number  of 
spikes  per  presentation  and  the  arrival  time  of  the  first  spike  alkr 
the  onset  of  the  stimulus  were  recorded  and  stored  in  a  computer 
( DEC  11/73).  The  recording  window  had  a  duration  of  50-85 
ms.  corresponding  to  the  stimulus  duration  and  excluding  any 
offset  responses. 

Data  analysis 

From  the  responses  to  673  different  frequency /level  combina¬ 
tions.  an  objectively  deteimined  FRA  was  constructed  for  every 
neuron  or  group  of  neurons.  Figure  I  shows  an  example  of  three 
reconstructed  FRAs  obtained  in  Al.  The  ordinate  conespondsto 
the  sound  level  of  the  tone-burst  stimulus,  whereas  the  abscissa 
corresponds  to  the  fiequeticy.  All  presented  stimuli  would  be  repre¬ 
sented  by  a  1 5  (ordinate)  by  45  (abscissa)  grid  with  equal  spacing 
and  size  of  elemenu.  Responses  are  iepftacnicd  at  the  point  of 
intersection  of  the  intensity  and  frequency  of  each  presented  stimu¬ 
lus  by  a  small  filled  rcctan^.  The  vertical  length  of  each  rectangle 
IS  proportional  to  the  number  of  spikes  discharged  in  response  to 
the  stimulus.  The  absence  of  a  rtcumgle  for  a  given  point  of  stimu¬ 
lus  presenution  represents  no  response.  Usually  each  stimulus 
was  presented  once.  If  the  resulting  FRA  was  not  well  defined,  the 
process  was  repeated  with  the  same  675  stimuli,  and  the  resulting 
evoked  activity  was  added  to  the  first  set  of  responses. 

.A  tuning  curve  was  extracted  from  the  FRA  by  the  use  of  an 
objeaive  method.  For  the  response  threshold,  a  computer  pro¬ 
gram  defined  the  isoresponse  criteria  as  ihe  spontaneous  rate  (esti¬ 
mated  from  45  points  outside  the  response  area )  plus  20%  of  the 
peak  rate  (9-potnt  weighted  averaging).  Weighted  9'point 
smoothing/ averaging  was  used  to  determine  the  response  at  each 
point  because  it  provided  a  more  reliable  estimate  by  increasing 
the  effective  number  of  repetitions  per  location  at  the  expense  of 
some  frequency  and  intensity  resolution  (Sutter  and  Schreiner 


Fraquancy  (kHz) 

no.  1 .  Frequency  response  areas  ( FRAs)  for  3  neurons  in  Al.  A  total 
of  675  tone  bursts  widt  different  frequency /level  combinations  wen  pre¬ 
sented.  Frequency/ level  combinations  marked  with  filled  rectangles  elic¬ 
ited  action  potentials.  Height  of  each  rectangle  corresponds  to  number  of 
action  potentials.  FRAs  are  plooed  for  a  fitqiiency  range  of  4  octaves. 
Bandwidth  values  were  obtained  10  (open  arrows)  and  40  dB  (filled 
arrows)  above  minimum  thtesltold.  .4:  characteristic  frequency  ((TF)  ■ 
9.03  kHz.  BWlO  -  0.13  ocuvcs(Q-IOdR  -  1 1.3).  BW40  -  0.26  octaves 
((}-40<iB  -  5.66).  fl:  (T  -  5  kHz.  BWIO  «  0.55  ocuvei( 2.38),  BW40 - 
0.80  ocuves  ( 1.79).  C:  CF  -  9.7  kHz.  BWIO  -  1.44  octaves  (0.70). 
BW40  -  2. 12  ocuves (0.49). 

1991a).  This  criterion  was  robust,  yielding  comparable  tuning 
curves  for  the  wide  range  of  FRAs  recorded,  including  high  sponu- 
neous  multiple-tmit  and  low  spontaneous  single-unit  recordings. 
Shaded  areas  in  Ftg.  I  corre^nd  to  the  objectively  determined 
FRAs;  the  border  of  the  shading  corresponds  to  the  tuning  curve. 
Sevei’al  response  propeities  were  measuted  from  each  FRA. 
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Among  tliom  v^ere  / )  the  eharactcnstic  frequence  i  CF;  the  stimu¬ 
lus  trequencs  «.ith  the  lowest  sound  pressure  level  neeessarv  to 
evoke  neuronal  activitv  i: I  BW  10 1  the  bandwidth  of  the  FR.A.  in 
octaves.  I(i  dB  above  minimum  threshold),  and  J)  BW40 
(the  bandwidth  of  the  FR.A.  in  octaves.  40  dB  above  minimum 
threshold  I. 

For  units  vvuh  multipeakcd  tuning  curves,  the  entire  response 
area  including  the  nonresponsive  area  between  individual  peaks 
was  used  lo  determine  the  cxciiatorv  bandwidth. 

R  E  S  L  L  r  S 

Results  are  based  on  a  total  of  146  multiple  units  and  103 
single  neurons  from  9  cats.  On  the  average,  multiple-unit 
maps  were  denved  from  15  fairly  evenly  spaced  record¬ 
ing  locations  over  a  restricted  CF  range  (usually  <2  oc¬ 
taves).  From  the  multiple-unit  recordings,  three  parame¬ 
ters  were  extracted.  First,  the  A.1/A11  border  was  deter¬ 
mined  in  accordance  with  critcna  suggested  by  Schreiner 
and  Cynader  ( 1984).  The  criteria  for  All  arc  based  on  a 
reliable  and  spatially  consistent  shift  of  three  multiple-unit 
physiological  properties;  /)  an  increase  in  thresholds  by 
a  15  dB.  2)  Q-lOdB  values  (CF/ excitatory  bandwidth  10 
dB  above  minimum  threshold)  below  a  certain  CF-depen- 
dent  value,  and  i)  a  blurring  of  the  CF  topography.  Single 
neurons  falling  into  All  were  not  included  in  the  analyzed 
set  of  data.  Second,  the  approximate  orientation  of  the  iso¬ 
frequency  contours  within  AI  was  determined,  and  third, 
the  distribution  of  the  integrated  excitatory  bandwidth  10 
and  40  dB  above  minimum  threshold  was  reconstructed. 

A  large  range  of  bandwidths  for  multiple  and  single  units 
were  encountered  in  this  series  of  experiments  correspond¬ 
ing  to  ranges  seen  for  ^lOdB  and  <5-40dB  for  multiple 
units  in  previous  experiments  (Schreiner  and  Mendelson 
1 990 ) .  as  well  as  for  Q- 1  OdB  ( Phillips  and  Irvine  1 98 1 )  and 
bandwidth  values  (Abeles  and  Goldstein  t970a.b;  Hind 
1 960 )  for  single  neurons.  Figure  1  displays  typical  examples 
of  FRAs  with  sharp,  medium,  and  broad  frequency  tuning 
of  single-neuron  responses.  Bandwidth  measures  BWIO 
and  BW40  for  the  units  are  indicated. 

Topography  of  integrated  excitatory  bandwidth 

To  characterize  topographically  the  dorso ventral  axis  of 
AI.  electrode  penetrations  orthogonal  to  the  cortical  surface 
were  placed,  in  most  cases,  in  close  approximation  to  the 
orientation  of  isofrcquency  contours.  The  actual  CFs  used 
to  create  a  map  are  illustrated  for  two  examples  in  Fig.  S. 
For  most  cases,  isoiiequency  contours  in  AI  could  be  ap¬ 
proximated  with  a  straight  line  deviating  slightly  from  the 
dorsal-ventral  direction  (see  Fig.  5C,  the  isofrequency  axis 
is  roughly  parallel  to  the  abscissa).  Occasionally,  the  angle 
between  the  orientation  of  the  approximated  course  of  an 
isofrequency  c  ontour  deviated  as  much  as  30°  from  the 
dorsovcntral  iixis.  The  CF  topography  became  weaker  at 
the  most  dorsal  extent  of  the  ectosylvian  gyrus  where  isofre¬ 
quency  contours  often  slanted  posteriorly  ( sec  Fig.  5.  A  and 
B)  { Middlebrooks  and  Zook  1983;  Sutter  and  .Schreiner 
1 99 1  a ) .  Cortical  recording  locations  are  represented  as  dor- 
soventral  distance  along  the  approximated  orientation  of 
the  straight  portion  of  isofrequency  contours.  The  range  of 
CFs  included  for  each  case  varied  between  0.2  and  3  oc- 
uves.  Essentially  all  locations  included  in  the  current  data 


evaluation  had  CFs  >4  kHz.  In  contrast  to  most  previous 
studies,  the  sharpness  of  frequency  tuning  in  this  study  w  as 
measured  as  bandwidth  expressed  in  octaves.  Figure  2  dis¬ 
plays  BWlO  and  BW40  for  all  single  neurons  in  this  study. 
Similar  to  the  traditionally  used  Q-factor  ( CF/bandwidth ) . 
the  bandwidth  measure  reduces  the  influence  of  absolute 
frequency  on  this  tuning  curve  characteristic  and  has  the 
additional  advantage  of  being  independent  from  the  loca¬ 
tion  of  the  CF  within  the  FRA.  i.e..  it  is  not  influenced  by 
the  symmetry  of  the  FR.A  or  tuning  curve.  For  single  units, 
the  correlation  between  bandwidth  and  CF  was  statistically 
not  significant  over  the  limited  frequency  range  sampled  in 
this  study. 

The  spatial  distribution  of  the  integrated  excitatory  band- 
widths  for  pure  tones  obuined  in  this  study  was  consistent 
with  that  of  previous  reports  (Schreiner  and  Mendelson 
1990:  Sutter  and  Schreiner  1991a).  Locations  with  sharply 
tuned  multiple-unit  responses  yielding  a  narrow  integrated 
excitatory  bandwidth  were  found  in  a  region  near  the  dor- 
soventral  center  of  the  straight  segment  of  isofrcquency 
contours  l.S-3  mm  dorsal  to  the  Al/ All  border.  Locations 
dorsal  and  ventral  to  this  point  gradually  showed  more 
broadly  tuned  responses. 

Figure  3  illustrate  multiple-unit  bandwidth  maps  ob¬ 
tained  10  and  40  dB  above  minimum  reponse  threshold 
for  two  individual  cases.  A  and  B  show  the  reconstructed 
BW40  maps.  Circle  connected  by  dotted  line  show  the 
aaual  bandwidth  value.  These  and  other  case  show  a  dis¬ 
tinct  bandwidth  minimum  surrounded  by  areas  with 
broader  frequency  tuning.  Case  SUTC16  iB  and  D) 
showed  a  larger  variability  in  shyness  of  tuning  in  areas 
that  contained,  on  the  average,  wider  bandwidths.  The  dor¬ 
so  ventral  progression  of  bandwidth  change  was  much 


FIG.  2.  Distribution  of  BWaOsnd  BWIO  uafunciion  of  characteristic 
frecuency  for  all  single  neurons. 


TOPOGRxPH'i  OF  EXCITATORN  BANDVUDTH  IN  Al 


U'Jl 


FIG.  Distribution  ofmuiiiple'unuBW  to  and  BW40  values  along  the  dorsal-ventral  axis  of  AI  for  2  cortexes..'!  end  B: 

BW40  values  (circles)  are  connected  by  dashed  lines.  Solid  lines  represent  smoothed  bandwidth  distnbution.  A  spatial 
weigltting  algonthm  was  used  for  smoothing  aaoss  SOO-um  seaors:  100%  smoothing  for  poinu  at  the  same  location  to  no 
weighting  for  points  SOO  um  apan.  Arrows  mark  the  locations  with  the  minimum  average  BW40  (  BW40m,.).  C  and  f>: 
B w  1 0  values  ( circles )  and  smoothed  bandwidth  distribution.  Arrows  mark  BW 1 0.,. . 


smoother  in  case  SUTC12  (.-1  and  C).  To  aid  in  demarcat¬ 
ing  the  minima  of  bandwidth  plots,  a  500-stm.  spatially 
weighted  smoothing  algorithm  was  used  ( 100%  smoothing 
for  points  at  the  same  cortical  distance  to  no  weighting  for 
points  >500  ^ra  apart).  The  smoothed  version  of  the 
BW40  maps  are  shown  as  solid  lines  in  Figs.  3,  A  and  5, 
respectively.  Arrows  point  to  the  location  of  the  main  mini¬ 
mum  in  each  bandwidth  function.  The  data  points  and  the 
smoothed  version  of  the  BW  10  map  for  these  two  cases  are 
shown  in  Fig.  3.  Cand  D.  in  both  cases  the  BWIO  minima 
are  located  0.1-1  mm  dorsal  to  the  BW40  minimum  in 
agreement  with  previous  findings  for  Q-lOdB  and  Q-40dB 
(Schreiner  and  Mendelson  1990).  The  minima  of  these 
functions  ( BWlOn„„  and  BW40„,„)  served  as  physiological 
reference  points  to  spatially  pool  excitatory  bandwidth  data 
across  animals. 

Single-neuron  maps 

The  spatial  distribution  of  excitatory  bandwidths  for  sin¬ 
gle  neurons  and  multiple  units  is  shown  in  Fig.  4  for  case 
SUTC16.  For  this  animal  a  sufficient  number  («  -  27)  of 
single  neurons  were  sampled  to  directly  compare  single- 
and  multiple-unit  topographies  within  the  same  animal. 
The  origin  of  the  abscissa  designates  the  location  of  the 
combined  bandwidth  mini  num.  that  is.  the  averaged  loca¬ 
tion  of  the  BW  1 0  and  BW40  minima.  For  BW40  ( Fig.  4  A). 
the  spatial  distribution  of  the  single-neuron  sharpness  was 


quite  similar  to  the  integrative  excitatory  bandwidth.  Neu¬ 
rons  with  the  narrowest  bandwidth  were  located  at  or  near 
the  multiple-unit  bandwidth  minimum.  Dorsal  to  the  mini¬ 
mum.  the  bandwidth  for  single  neurons  increased  ( linear 
regression  analysis,  r  -  0.84,  P  -  0.02 )  and  showed  a  scatter 
similar  to  that  expressed  in  the  multiple-unit  responses  ( r  <■ 
0.95.  P  -  0.00 1 ).  Vcntrally,  the  excitatory  bandwidth  of 
single  neurons  also  increased  progressively  (r  «  0.57.  P  - 
0.02),  although  not  as  much  as  seen  for  the  distribution  of 
multiple  units  (r  -  0.92.  P  -  O.OOl).  Sharply  tuned  neu¬ 
rons  were  found  throughout  the  ventral  region  of  AI.  and. 
overall,  the  range  and  scatter  of  the  intej^ted  excitatory 
bandwidth  appeared  to  be  larger  than  for  the  single-neuron 
bandwidth.  The  distribution  of  BWIO  values  for  single  and 
multiple  units  (Fig.  4B)  was  also  fairly  similar.  In  this  par¬ 
ticular  case  the  minimum  of  the  bandwidth  distribution 
was  not  very  strongly  expressed  for  either  multiple  ( see  Fig. 
3D)  or  sin^e  neurons.  However,  the  apparent  scatter  of 
encountered  bandwidths  appeared  to  be  the  smallest  near 
the  estimated  center  of  the  bandwidth  distribution.  The 
correlation  between  location  and  BW  10  was  statistically  sig¬ 
nificant  only  for  multiple  units  in  the  dorsal  part  of  AI  ( r  - 
0.8 1,  P  "  O.OOl ).  The  CFs  of  the  single-  and  multipie-unit 
neurons  are  shown  in  Fig.  5.  In  this  particular  case,  the  CFs 
covered  a  range  of  ~  I  octave.  Although  the  multiple-unit 
distribution  shows  a  clear  correlation  between  dorsoventral 
distance  and  CF  (dorsal  half;  r  -  0  65.  P  ■  0.0 1 :  ventral 
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Fig.  6.  C  and  D.  Finally,  the  locations  of  single  neurons 
were  pooled  across  both  aligned  maps  as  displayed  in  Fig. 
6  £■.  No  distance  normaluation  or  distortion  of  the  indi\  id- 
ual  recording  locations  is  involved  in  this  method,  and.  ac¬ 
cordingly.  the  pooled  data  reflect  the  trends  discernible  in 
each  individual  case.  This  procedure  was  used  to  pool  data 
across  all  nine  cases. 

fiWVO  Topography 

Scattergrams  of  BW40  values  for  all  pooled  single  and 
multiple  units  are  shown  in  Fig.  7.  and  B.  Cortical  loca¬ 
tions  were  aligned  relative  to  BW40n„n  for  each  individual 
case  as  described  above.  For  multiple-unit  responses,  the 
pooled  BW40  data  show  the  same  pattern  seen  in  individ¬ 
ual  cases  in  this  and  previous  studies  ( Schreiner  and  Men- 
dclson  1990:  Sutter  and  Schreiner  1991a).  In  particular,  a 
region  of  sharp  tuning,  located  at  the  approximate  dorso- 
vcntral  center  of  Al.  was  aligned  with  integrated  e.\ciutor> 
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FIG.  4.  Bandwidth  distributiun  along  approximated  isofrequency  do¬ 
main  of  AI  (cau  SUTC16 ).  BW40  (.4)  and  BWIO  (£)  I'or  lingle-neuron 
( -I- )  and  multiple-unit  (o )  are  plotted  at  a  function  of  cortical  distance. 
Zero  millimeter  of  the  cortical  distance  axis  corresponds  to  the  location  of 
BW40„,.. 


half;  r  ■■  0.83.  P  ■  0.00 1 ),  no  significant  correlation  be¬ 
tween  CF  and  location  or  bandwidth  was  seen  for  single 
neurons  in  either  half  of  AI  (see  discussion). 

Pooling  relative  to  multiple-unit  maps 

The  reconstruction  of  the  spatial  bandwidth  distribution 
for  case  SUTC16  was  made  possible  by  sampling  a  rela¬ 
tively  large  number  of  single  ( 27 )  and  multiple  ( 28 )  units  in 
the  same  experiment.  This,  however,  was  an  exceptional 
case,  and  therefore  a  method  of  pooling  data  across  animals 
was  necessary  to  generally  assess  the  topography  of  single- 
neuron  characteristics. 

In  this  study  the  location  of  single  neurons  within  an 
experiment  was  referenced  relative  to  the  multipie-unit 
bandwidth  map.  The  alignment  and  pooling  proc^ure  is 
illustrated  in  Fig.  6  for  the  two  cases  shown  in  Figs.  3  and  4. 
To  aid  in  visualmng  this  procedure,  single  neurons  from 
case  SUTC12  are  marked  by  filled  diamonds,  and  single 
neurons  from  case  SUTC16  are  marked  by  crosses.  The 
unsmoothed  BW40  maps  for  cases  SUTC12  and  SUTC16 
( from  Fig.  3.  ,^1  and  B)  are  represented  by  open  circles  con- 
neaed  by  lines  in  Fig.  6,  A  and  B.  For  these  two  experi¬ 
ments  the  dorsal-ventral  extent  of  AI  was  *«6.2  and  S.3 
mm,  rcspcaively.  To  pool  data  across  these  two  experi¬ 
ments.  the  minimum  in  the  smoothed  BW40  distribution 
(BW40„,„  marked  by  arrows)  was  assigned  the  reference 
value  of  0  mm,  and  the  positions  of  all  recording  points 
were  expressed  as  distance  in  millimeters  from  BW40„,„. 
Next,  the  BW40„,„  of  all  cases  was  aligned.  The  shifted  and 
aligned  recording  locations  of  the  two  cases  are  shown  in 


1 1.0 
et 


V 

’  *  Z7  •  •  I*  1  «11  7 

us 


2 

SUTCW 


M 

^“.73 


1.1  S.t70*l.3 

».2 
7,3  • 

7.3  10 : 


2 

C90-185 


llJ  A , 

9  194 


UJ.V  17,7  >»-S 
*  15;',77 
i7.l  l»J 


-2-10122 
Dorsal-Ventral  Dlctance  (mm) 

FIG.  3.  Conical  locatioiu  of  chancteriMic  frequencies  (CFx  I  for  multi¬ 
ple  and  single  units, ,{ and  B :  locauooi  of  mutuplc-  and  single-unit  record¬ 
ings.  respectively,  for  case  SUTC 1 6.  CFs  ( in  kHz )  are  indicated  next  to  the 
location  marker.  C;  both  single-  and  multiple-unit  recording  locations  for 
case  C90- 1 83.  Multipie-unit  CFs  are  indicated  above  the  location  marker, 
and  siiigie-unit  CFs  are  indicated  below  the  markers  by  italic  numbers.  At 
some  locauons.  several  single  units  were  isolated.  Note  that  the  scale  of  the 
caudal-rostral  distance  is  expanded  compared  with  the  dorsal-ventral  dis¬ 
tance. 
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Kia.  6.  Demonjtnition  of  meihod  used  (0  pool  dau  across  animals.  A  and  C:  tranifonmation  Tor  case  SUTC 1 2.  Band- 
widths  of  single  neurons  40  dB  above  minimum  threshold  (diamonds )  are  plotted  on  same  location  axis  as  the  multiple-unit 
BW40  map  ( connected  circles ) .  In  .I ,  the  most  dorsal  locsoiun  is  arbitrarily  assigned  the  value  0  mm.  In  C.  BW40„,„  ii 
assigned  a  value  of  0  mm.  and  all  other  locations  arc  referenced  as  millimeten  dorsal  ( negative )  or  ventral  ( positive )  from 
this  BW40„,„.  B  and  D:  same  for  case  SUTCI6  with  single-neuron  BW40s  depined  by  crosses.  BW40„„  for  the  2  case  is 
aligned  (see  C  and  D).  and  then  the  single  neurons  ore  superimposed  within  the  same  coordinate  frame  ( £)  to  yield  the 
pooled  result. 


bandwidths  that  gradually  increased  with  increasing  dis¬ 
tance. 

The  pooled  distribution  of  BW40  for  single  neurons  ( Fig. 
7  B )  shows  some  similarities  and  some  dissimilarities  to  the 
multiple-unit  distribution.  Doisal  to  the  BW40  minimum, 
single  units  show  the  same  tendency  of  increasing  band¬ 
width  with  dorsal  distance  as  seen  with  multiple  units.  Par¬ 
tially  responsible  for  this  overall  increase  in  bandwidth  is 
the  occurrence  of  multipeaked  FRAs  in  the  dorsal  aspect  of 
AI  (Fig.  IB,  +)  in  accordance  with  a  previous  report  of 
their  spatial  distribution  ( Sutter  and  Schreiner  1991a).  Sin¬ 
gle-peaked  FRAs  in  dorsal  Al  also  tended  to  have  broader 
tuning.  However,  some  sharply  tuned  single-  and  multiple- 
neuron  responses  can  be  found  across  the  entire  dorsal  half 
of  A 1  or  at  least  up  to  3  mm  dorsal  to  the  bandwidth  min¬ 
imum. 

In  the  AI  portion  ventral  to  the  bandwidth  minunum.  the 
pooled  single-neuron  data  deviated  from  the  trend  seen  in 


multiple-unit  recordings  as  well  as  from  that  seen  in  the 
single-neuron  example  shown  above  ( Fig.  4 );  BW40  of  al¬ 
most  all  ventral  single  neurons  remained  essentially  sharply 
tuned  ( <0.7  octaves)  independent  of  their  location  in  ven¬ 
tral  AI.  By  contrast,  the  multiple-unit  recordings  showed 
increasing  bandwidths  with  disunce  from  the  BW  mini¬ 
mum.  The  location  of  the  AI/ All  border  varied  from  1.65 
to  3.45  mm  from  BW40„,„,  and  no  single  neurons  located 
within  All  were  included.  However,  even  locations  as  close 
as  1  mm  ventral  to  the  BW40  minimum,  and  therefore 
located  well  within  Al.  showed  multiple-unit  locations  with 
bandwidths  in  excess  of  I  octave,  whereas  al!  ventral  single 
neurons  remained  below  that  value.  Figure  7C  shows  a  di¬ 
rect  comparison  of  the  single-  and  multiple-unit  scatter- 
grams.  BW40  values  were  binned  and  averaged  over  500 
ttm.  The  mean  BW40  values  of  multiple  (□ )  and  single  ( ■ ) 
units  are  shown.  Sundard  deviations  are  represented  by 
error  bars.  The  mean  and  standard  deviation  of  BW40  in- 
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no.  7.  Pooled  BW40  data.  Reference  for  pooling  iJ  the  most  sharply 
'uiictl  location  40  dB  above  threshold  of  multiple-unit  rcsponics. 
B\V40„„.  .4 ;  distribution  for  all  multiple-unit  recordings,  fl;  distribution 
for  all  sinjile-unit  recordings.  C:  mean  value  of  BW40  over  0.5  mm  bins  for 
multiple  I c )  and  single  (■  I  uniis.  Sundaid  deviauons  are  represented  by 
error  bars.  Dashed  lines  in  .4  and  B  correspond  to  the  regression  tines 
derived  independently  for  the  areas  dorsal  and  ventral  of  BW_. 


creased  gradually  toward  the  dorsal  end  of  Al  for  multiple 
and  single  units.  Ventrally.  mean  and  standard  deviation  of 
BW40  for  single  neurons  remained  consunt.  whereas  tlic 
multiple-unit  values  increased  toward  and  into  All. 

The  gradients  of  the  bandwidth  progre^ons  dorsal  and 
ventral  to  the  bandwidth  minimum  can  be  approximated 
by  regression  lines  as  is  shown  by  the  dashed  lines  in  Fig.  7, 
■•1  and  B.  For  the  dorsal  portion  of  the  multiple-unit  distri¬ 
bution.  the  regression  line  is  described  by  y  -  -0.25,'c  +■ 
0.42  (r  =  0.49).  The  slope  of  the  dorsal  gradient  is  -0.25 
octaves/ mm.  The  95%  confidence  interval  of  the  slope  lies 
between  -0. 14  and  -0.38  ocuves/mm.  The  regression  line 
of  the  ventral  multiple-unit  BW40  gradient  was  y  “ 
0.23.V  +  0.47  (r  -  0.38).  The  slope.  0.23  octaves/mm.  is 
similar  although  slightly  less  than  that  of  the  dorsal  gra¬ 
dient.  The  larger  scatter  on  the  ventral  side  is  reflected  in  a 
95%  confidence  interval  containing  slopes  between  0.09 
and  0.36  octaves/ mm.  For  both  sides  of  the  distribution, 
the  zero  slope  lies  outside  of  the  99.5%  confidence  intervals. 
Therefore  it  is  concluded  that  th(.  multiple-unit  BW40  dis¬ 


tribution  along  the  dorsovcntral  a.\is  of  Al  is  concave  with  a 
minimum  ( V  shaped ) . 

Contrasting  the  multiple-unit  distribution,  the  single- 
unit  topography  (  Fig.  7  B 1  appeared  to  be  essentially  flat  on 
the  ventral  side.  The  slope  of  the  regression  line  (  r  « 
0.05 .V  *  0.28;  r  =  0.03)  was  0.05  octaves/ mm.  The  95'T 
confidence  intervals  comprise  slopes  between  -0.04  and 
0.13  octaves/mm.  which  includes  zero  slope.  Therefore  it 
cannot  be  claimed  with  95%  confidence  that  the  slope  is 
greater  than  zero.  Dorsally.  the  slope  of  the  average  c.xcit- 
atory  bandwidth  was  -0.28  octaves/ mm  l.i'  ■  -0.28, v  - 
0.28;  r  -  0.48;  95%  confidence  interval;  —0.13  to  -0.42 
octaves/ mm),  that  is.  similar  to  the  dorsal  slope  obtained 
for  multiple  units. 

As  apparent  from  Fig.  5.  case  SUTC16  showed  some  sys¬ 
tematic  variation  of  CF  along  the  dorsovcntral  a.xis  of  Al. 
This  was  partially  due  to  some  obscurity  in  the  exact  orien¬ 
tation  of  isofrequenc^  contours  during  the  experiment  and 
to  the  usually  encountered  caudal  deviation  of  isofrequency 
contours  in  the  doisal  portion  of  Al.  To  test  whether  the 
bandwidth  changes  in  the  dorsovcntral  direction  of  Al  were 
indeed  a  spatial  characteristic  and  were  not  just  an  effect  of 
the  sampled  frequency  distribution  along  that  axis,  a  regres¬ 
sion  analysis  of  BW46  with  dorsoventral  location  as  well  as 
with  CF  was  conducted  for  the  dorsal  and  the  ventral  por¬ 
tion  of  all  the  obtained  maps.  The  variance  in  the  BW40 
data  that  could  be  attributed  to  changes  in  the  dorsoventral 
locations  was  2 1 4  ( ventral )  and  3 1 8%  f  dorsal )  larger  than 
the  variance  attributable  to  CF  changes.  This  suggests  that 
the  BW  changes  seen  in  the  dorsoventral  extent  of  Al  arc 
also  present  within  isofrequency  contouts. 

Parceling  oJ'Al:  regional  (inferences  in  BW40 

To  test  the  hypothesis  that  the  dorsovcntral  extent  of  .Al 
may  consist  of  more  than  one  physiologically  distinguish¬ 
able  region,  Al  was  subdivided  into  three  regions  on  the 
basis  of  the  bandwidth  distribution;  1  mm  to  each  side  of 
the  location  of  minimum  bandwidth  was  arbitrarily  defined 
as  the  “central”  region  of  Al;  dorsal  and  ventral  to  this  area 
were  defined  correspondingly.  The  minimum  bandwidth 
measure  used  to  pool  data  for  this  analysis  was  the  location 
of  BW40n„„  and  BW10„,„  averaged  for  each  particular  case 
(BW„,„).  This  measure  was  used  so  that  the  same  topo¬ 
graphical  parceling  can  be  performed  without  biasing  to¬ 
ward  either  bandwidth  uicasuic.  Figure  8  shows  the  histo¬ 
grams  of  BW40  for  the  dorsal,  ventral,  and  central  regions 
of  Al  for  single  (■)  and  multiple  (□)  units.  Median  values 
for  the  hist(3grams  are  summarized  in  Fig.  8C.  Deviation 
bars  represent  one-half  of  the  75th  percentile  BW  value 
minus  the  25th  percentile  value. 

Table  1  lists  the  mean  BW40  values  found  with  single- 
and  multiple-unit  recordings.  A  significant  difference  be¬ 
tween  the  BW40  disttibutions  of  multiple  and  single  units 
in  the  ventral  and  central  parts  of  Al  was  established 
( nonparametric  Mann- Whitney  U  test.  P  <  0.0 1.  see  Ta¬ 
ble  2).  In  dorsal  Al.  no  significant  difference  was  found 
(P>0.05). 

Analysis  of  variance  (ANOVA)  was  applied  to  test  for 
differences  in  bandwidth  values  of  the  three  regions.  Signifi- 
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FIG.  8,  Disinbution  of  BW40  in  3  retioni  of  the  dortoventrai  dimension  of  A1  ( chanctehstic  frequency.  >4  kHx). 
Central  rction  includes  area  I  nimtobothsjdesorBW,„„foraioulof2mm.  Dorsal  region  comprises  all  of  AS  >1  mm  dorsal 
to  BW..,,, .  and  ventral  region  >  I  mm  ventral  to  BW.,., .  .1 :  histograms  of  BW40  values  for  single  neurons  in  Al.  B‘.  BW40  for 
multiple-unit  recordings.  C :  mean  values  ( bars  I  and  sundard  devutions  ( error  bars  I  of  BW40  values  shown  in  .4  and  B. 


cant  differences  ( P  <  O.Ol)  between  regions  were  found  for 
both  single-  and  multiple-unit  B W40  values  ( Table  3 ) .  Post 
hoc  Sheffe  analysis  showed  that  the  ANOVA  results  for 
multiple  units  were  due  to  differences  between  the  central 
and  ventral  regions  as  well  as  between  central  and  dorsal 
differences  for  multiple  units.  For  single  units.  ANOVA 
differences  could  be  accounted  for  by  dorsal  versus  central 
and  dorsal  versus  ventral  regional  differences.  Nonparamet- 
ric  Mann-Whitney  U  tests  confirmed  the  Sheffe  results. 

Bifi  'IO  Topography 

The  bandwidth  or  Q-value  10  dB  above  threshold  is  offen 
less  appropriate  for  a  description  of  sharpness  of  frequency 
tuning  of  single  cortical  neurons  than,  for  example,  Q40dB. 
because  some  influences  that  shape  FRAs  are  not  fully  acti¬ 
vated  at  stimulus  levels  near  minimum  threshold  ( Sup  and 
Manabc  1982;  Sup  and  Tsuzuki  1985).  However,  because 
this  measure  has  been  almost  exclusively  used  in  cortical 
studies  of  species  other  than  bats  and  it  does  show  some 
systematic  spatial  variations  in  multiple-unit  recordings 
(Schreiner  and  Mendelson  1990;  Sutter  and  Schreiner 
199  la ).  it  was  analyzed  here  as  well  to  allow  comparisons 
with  other  studies. 


Scattergrams  of  BWIO  foi  single  and  multiple  units  arc 
shown  in  Fig.  9.  Locations  were  pooled  such  that  BWlO„,n 
was  assigned  the  distance  reference  value  of  zero  for  each 
case.  Although  the  multiple-unit  distribution  appeared  to 
be  V  shaped,  paralleling  the  multiple-unit  BW40  distribu¬ 
tion,  such  an  effect  appeared  to  be  less  pronounced  in  the 
distribution  of  BWlOforsingle  neurons.  For  both  multipie- 
and  single-unit  recordings,  FRAs  narrowly  tuned  10  dB 
above  minimum  threshold  were  encountered  throughout 
the  entire  dorsoventral  dimension  of  A  I.  Dorsal  to  the 
bandwidth  minimum,  the  slope  of  the  regression  line  was 
-O.lO  octaves/mm  (dashed  line;  y  -  -O.lO.x  +  0.14;  r  - 
0.5;  95%  confidence  interval;  -0.06  to  - 1 . 1 5  ocuves/ mm) 
for  multiple  units.  Ventrally,  the  slope  was  0. 1 1  octaves/ 
mm  (y  -  O.l  Lx  +  0.16;  r  -  0.37;  95%  confidence  interval; 
0.05-0.18  octaves/ mm).  For  both  dorsal  and  ventral 
halves,  zero  slope  fell  outside  of  the  99.5%  confidence  inter¬ 
val,  indicating  that  the  distribution  is  V  shaped  with  99.5% 
confidence  and  thus  is  similar  to  the  BW40  distribution. 

Regression  analysis  of  single-unit  BWIO  topography 
(dashed  lines  in  Fig,  9  5)  did  not  show  slopes  significantly 
different  from  zero  cither  dorsally  or  ventrally.  Dorsaiiy. 
the  slope  was  -0.03  ocuves/ mm  (y  ■  -0.03. v  +  0.19:  r  - 
0.14;  95%  confidence  interval:  -0.08-0.02  ocUves/mm). 


TABLE  I .  FR.4.  bandwidth  measure  for  single-  and  multiple-unit  recordings  in  Al 


Total 

Donoi 

Central 

Ventral 

BW40.  MU 

0.80  a:  0.56  1146) 

0.98  1 0.59  (45) 

0.57  1  0.42  (59) 

0.95  tO.59  (42) 

BW40.  SU 

0.50  t  0.46  (99) 

0.88  10.61  (29) 

0.31  10.24(46) 

0.40  10.25  (241 

BWIO.  MU 

0,30  i  0.24  (146) 

0.35  10.23  (45) 

0.22  lO  14(59) 

0,37  1 0.32  (421 

BWIO.  SU 

0.20  10.17(103) 

0.26  10.21(31) 

0.19  10.14(47) 

0.16i0.14i25l 

Values  arc  means :::  SD:  number  of  neuroiu  are  in  parentheses.  Baitdwtdth  expreued  in  octaves.  The  central  region  represents  a  2-mm  sector  of  the 
dorsoventral  extent  of  Al  centered  at  BW...,.'.  the  dorsal  and  ventral  regions  cover  the  remainder  of  Al.  FRA.  frequency'  response  area:  Al.  pnmary 
auditory  conex;  BW40  and  BW|0.  bandwidth  40  and  10  dB  above  minimum  threshold,  respectively.  MU.  multiple  units:  SU.  single  units. 
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TABLE  2.  Ci'iupansim  ni  FR.i  hund^udili  for  'uncle-  etnd 
imdtiple-imt  measures 


Dorsal 

Central 

Ventral 

BW4(J.  MU'  vs.  SU 

0-51 

O.OOH 

0.0001 

Bvy  lU.  MU  vs.  SU 

0.034 

0.049 

0.0001 

P  values  from  Mann-Whitney  f  tests.  FRA.  frequency  response  area; 
BWao  and  BWlO.  band^^idttt  40  and  10  dB  above  minimum  ihicsitold. 
respectiveU;  MU,  multiple  units;  SL'.  single  unit-; 


Vcntrally.  the  gradient  was  0.01  octaves  /  mm  (r  =  0.0 l  .v  + 
u. lb;  /•  =  0.06:  95%  confidence  interval:  -0.04  and  0.06 
octaves/ mm  I,  it  is  concluded  that  the  sharpness  of  fre¬ 
quency  tuning  10  dB  above  minimum  threshold  does  nos 
change  systematically  for  single  neurons  across  the  dorso- 
ventral  extent  of  AI. 

FarceUng  ofAl:  regional  differences  in  BH'IO 

For  BWlO.  AI  was  parceled  as  described  for  BW40.  His¬ 
tograms  are  shown  for  single  (Fig.  lO.-l.  ■)  and  multiple 
( Fig.  lOfl.  □!  units  in  dorsal,  central,  and  ventral  AI.  The 
summary  histogram  of  medians  is  shown  in  Fig.  IOC.  The 
mean  BW 10  values  for  the  three  regions  are  given  in  Table 
I .  There  was  a  significant  difference  between  BW  1 0  for  sin¬ 
gle  and  multiple  units  in  the  ventral  ( P  <  0.0 1 )  and  dorsal 
( P  <  0.05 )  regions  of  AI  ( see  Table  2 ) .  Centrally,  there  was 
no  significant  difference  fbr  BW  10  between  single  and  mul¬ 
tiple  units  (P>  0.05). 

For  multiple  units,  ANOVA  revealed  significant  differ¬ 
ences  among  dorsal,  ventral,  and  central  regions  ( Table  3 ). 
Sheffc  post  hoc  tests  showed  the  significant  ANOVA  was 
due  to  differences  between  the  dorsal  and  central  region 
and  between  the  central  and  ventral  region.  Mann-Whitney 
I'  t'^ts  confirmed  the  Sheffe  results  (Table  2).  For  single 
units.  ANOVA  ( confirmed  by  Kruskal- Wallis  test)  showed 
no  significant  differences  in  BWlO  among  central,  ventral, 
and  dorsal  regions  in  AI  (Table  3). 

Scatter  of  CF  in  dorsoventral  extent  of  AI 

The  apparent  discrepancy  between  single-  and  multiple- 
unit  estimates  of  the  excitatory  bandwidth  in  the  ventral 
portion  of  Al  suggests  the  influence  of  other  physiolo^cal 
parameters  than  single-unit  bandwidth  on  ^e  multiple- 


TA5LE  3.  Gotfipanson  of  FRA  bandwidth  measures  between 
different  sectors  of.Al 


Donal  vs. 
Central 

Central  vs. 
Ventral 

Dorsal  vs. 
Ventral 

BW40.  MU 

0.000 1* 

0.000  !• 

0.64 

BW40.  SU 

0.000 1* 

0.13 

0.003* 

BWlO,  MU 

o.ooost 

0.00 1  • 

0,96 

BWlO.  SU 

0.20 

0.50 

0.10 

P  values  from  Mann-Whitney  V  tesu.  FRA.  frequency  response  area; 
At.  pnmary  auditory  cortex:  BW40  and  BW  10.  bandwidth  40  and  10  dB 
above  minimum  threshold,  respectively;  MU.  multiple  units;  SU.  single 
units,  •Ninety-nine  percent  significance  in  Shefli  test.  tNincty-fivc  per¬ 
cent  signihcance  in  Shefli  test. 
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fic.  9.  Pooled  BWlO  dau.  Reference  for  pooling  i.<  the  most  sharply 
tuned  location  10  dB  above  minimum  threshold  of  multiple-unit  re¬ 
sponses.  BW  1 .  A :  distribution  for  all  muttiple-unii  recordings.  B ;  dis¬ 
tribution  for  ail  single-unit  recordings.  C;  mean  value  of  BWio  over  0.5 
mm  bins  for  multiple  (a  I  and  sin^e  im)  units.  Sundard  deviations  are 
represented  by  error  bats.  Dashed  lines  correspond  to  the  regression  lines 
independently  derived  for  the  areas  dorsal  and  ventral  orBWt0„,.. 


unit  result.  More  specifically,  it  indicates  that,  near  mini¬ 
mum  threshold  of  multiple-unit  recordings,  there  are  top>o- 
graphical  dificrences  in  CF  scatter  and/or  scatter  in  the 
minimum  threshold  of  the  contributing  single  neurons, 

As  a  preliminary  test  of  the  hypothesis  of  a  varying  degree 
of  CF  scatter  across  AI.  a  post  hoc  analysis  of  single  neuron 
CFs  reiaiive  to  multiple-unit  CFs  was  conduaed.  The  CF 
scatter  was  determine  by  one  of  two  meth(xls.  If  a  multi¬ 
ple-unit  CF  was  available  for  the  same  cortical  penetration 
as  that  of  a  single  neuron,  the  difference  berween  the  two 
CFs  was  determined  (both  single-  and  multiple-unit  re¬ 
sponses  were  obtained  in  the  same  range  of  cortii^  depth: 
see  METHODS).  For  single-neuron  locations  that  did  not  co¬ 
incide  with  a  penetration  that  yielded  a  multiple-unit  mea¬ 
surement  but  were  located  between  at  least  two  multiple- 
unit  recording  locations,  the  multiple-unit  CF  at  the  loca¬ 
tion  of  the  single  neurons  was  estimated  through  linear 
interpolation/ triangulation  from  the  neighboring  record¬ 
ing  locations.  The  difference  between  the  estimated  multi¬ 
ple-unit  CF  and  the  actual  single-neuron  CF  was  obtained. 
Some  single  neurons  ( n  «  7 )  had  to  be  excluded  from  this 
analysis  because  experted  CF  values  could  not  be  deter- 
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FIG.  10.  Distribution  of  BWIO  in  3  regions  of  tbs  donoventra!  dimension  of  AI  ( chancterstic  frequency,  >4  ItHz). 
Central  region  includes  area  1  mm  to  both  sides  of  BW„„  for  a  total  of  2  mm.  Dorsal  region  romptisesallofAI>  I  mmdoisal 
to  BW^„.  and  ventnl  region  >  I  mm  ventral  to  BW„„..4  histogmmsof  BWIO  values  for  single  neurons  in  AI.  B-.  BWIO  for 
multiple-unit  recordings.  C:  mean  values  (tsars  land  standard  deviations  (error  bars)  of  BWIO  va.ues  shown  in.-l  and  B. 


mined  because  of  lack  of  aopropriately  positioned  multiple- 
unit  recording  sites  ( <500-tim  distance  from  the  single-unit 
site).  In  Fig.  11.  the  aaual  and  estimated  CFs  for  case 
SUTC16  are  plotted  for  multiple  units  (A )  and  single  neu- 


KiG.  II.  Predicted  v$.i«ctual  characteristic  frequencies  (CFs  I  for  multi¬ 
ple  and  single  units  of  case  SUTCib.  The  predicted  CFs  were  denved 
th’-ough  interpolation  Irom  several  multiple-unit  CFs  closest  to  a  given 
recording  sue.  Dashed  lines  indicate  identity  of  predicuon  and  aaual  CF. 
Formulas  of  the  regression  lines  and  the  corresponding  correlation  coefli- 
c'lcnu  are  displayed. 


rons  (B).  Tlie  dashed  lines  correspond  to  CF  estimates  that 
are  identical  to  the  actual  CF.  The  apparent  difference  in 
CF  scatter  for  multiple  and  single  neurons  is  confirmed  by  a 
regression  analysis  yielding  correlation  coefficients  of  0.89 
for  multiple  units  and  0.35  for  single  units.  The  average 
deviation  of  the  actual  CF  from  the  predicted  values  was 
0.33  ±  0.51  (SD)  octaves  for  single  uniu  compared  with 
0. 12  ±  0.09  octaves  for  multiple  units  ( F  -  4.53;  P  <  0.04 ) . 
The  spatial  disiriburion  of  the  CF  scatter  for  single  neurons 
is  shown  in  Fig.  12.  The  largest  local  CF  deviations  are 
located  at  the  ventral  and  dorsal  extremes  of  the  mapped 
area. 

The  resulting  mean  CF  scatter,  expressed  in  octaves,  for 
all  cases  (96  single  neurons)  is  shown  in  Fig.  13  as  seven 
1-mm-wide  sectors  across  the  dorsoventral  extent  of  AI. 
Near  the  bandwidth  minimum  (at  0  mm),  the  obtained 
mean  CF  scatter  was  smallest.  More  dorsally  and  especially 
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FIG.  12.  Donovenu^  dinributton  of  characteristic  frequency  (CF) 
scatter.  The  difference  of  the  predicted  CFs  and  the  aaual  CFs  of  single 
units  (SUTC16).  expressed  in  octaves,  are  plotted  as  a  funaion'of  the 
dorsoventral  location. 
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no.  1 3.  Estimated  scatter  of  characteristic  frequency  (CF)  as  a  func¬ 
tion  of  cortical  location  for  ‘^6  out  of  103  single  units.  CF  scatter  -  isingle- 
neuron  CF  -  expected  CF| .  Values  were  then  averaged  over  I  mm  of  the 
dorsoventral  extent  of  A!  to  derive  a  mean  CF  scatter  value.  Number  of 
neurons  in  each  of  the  7  bins  ( from  dorsal  to  ventral )  were  *5. 13. 1 2. 28. 20. 
8.  and  6.  respectivelv.  Eiror  bars  indicate  standard  error  of  the  mean. 

more  ventrally.  the  mean  CF  scatter  increased  substan¬ 
tially.  The  range  of  CF  scatters  at  different  locations  along 
the  dorsoventral  axis  of  AI  was  slightly  smaller  than  the 
average  BWIO  range  for  multiple  units  at  corresponding 
locations  (for  comparison,  sec  Fig.  9C). 

DISCUSSION 

In  this  scries  o‘‘ experiments,  the  bandwidth  of  FRAs  10 
and  40  dB  above  minimum  response  threshold  was  used  to 
investi^tc  1 )  how  single-neuron  response  properties  relate 
to  physiological  “gradients”  obtained  with  multiple-unit  re¬ 
cording  techniques,  and  2)  whether  pooling  of  data  across 
animals  can  be  successfully  used  to  reveal  general  physiolog¬ 
ical  topographies.  The  main  findings  of  this  study  can  be 
summarized  as  follows.  I )  Multiple-unit  bandwidth  data, 
pooled  along  the  dorsoventral  axis  of  AI,  reflect  the  same 
spatial  distribution  as  previously  seen  in  individual  cases 
(Schreiner  and  Mendelson  1990;  Sutter  and  Schreiner 
1991a).  namely,  a  region  '-1.5-3  mm  dorsal  to  the  All 
border  contain^  narrow  integrated  cxcitatoiT  bandwidths 
gradually  giving  way  to  more  broadly  tuned  responses  dor- 
sally  as  well  as  ventrally.  2)  Excitatory  bandwidths  of  single 
neurons,  pooled  relative  to  the  multiple-unit  bandwidth  to¬ 
pography.  significantly  differed  from  the  multiple-unit  dis¬ 
tribution  in  the  region  of  AI  ventral  to  the  bandwidth  mini¬ 
mum.  that  is,  no  clear  gradient  in  sharpness  of  single- 
neuron  responses  was  evident  for  cither  BW 10  or  BW40;  in 
the  region  of  AI  dorsal  to  the  minimum  in  multiple-unit 
bandwidth,  a  spatial  gradient  as  well  as  an  increasing  vari¬ 
ance  was  evident  for  the  BW40  of  single  neurons  similar  to 
the  one  obtained  for  multiple  units,  however,  no  clear  gra¬ 
dient  was  apparent  for  BWlO  of  single  neurons.  J)  Predic¬ 
tions  of  the  CF  of  a  given  cortical  location  from  the  CF  of 
neighboring  multiple-unit  CFs  resulted  in  a  good  agree¬ 
ment  with  the  aaual  multiple-unit  CFs.  By  contrast,  a  sub¬ 
stantial  deviation  of  single-unit  CFs  from  multiple-unit  es¬ 
timates  was  seen  for  single  neurons  located  near  the  ventral 
and  dorsal  ends  of  AI. 


.3/ eihudological  considerations 

Before  discussing  some  implicauons  of  these  findings  for 
the  interpretation  of  the  functional  organization  of  pnmary 
auditory  cortex,  a  brief  look  at  the  pooling  method  is  in 
order.  Single-  and  multiple-unit  properties  were  pooled 
along  the  dorsoventral  extent  of  .Al.  approximating  the  ori¬ 
entation  of  the  isofrcquency  axis  of  AI.  The  location  of  the 
minimum  in  multiple-unit  bandwidth  cither  10  or  40  dB 
above  minimum  response  threshold  was  used  as  a  reference 
point  for  the  pooling  of  data,  allowing  an  alignment  of  iso- 
frequency  contours  from  different  cortexes.  The  use  of  u 
single  point  of  reference  plus  the  orientation  of  the  contours 
docs  not  involve  normalization  of  the  length  of  the  dorso¬ 
ventral  extent  of  AI.  Rcale  and  Imigf  1980)  gave  estimates 
of  the  dorsoventral  extent  of  AI  ranging  from  4  to  ”  mm. 
However,  precise  anatomic  or  physiological  criteria  for  es¬ 
tablishing  the  extent  of  that  dimension  are  not  available 
( see  Middlebrooks  and  Zook  1983;  Schreiner  and  Cynader 
1984;  Schreiner  and  Mendelson  1990;  Sutter  und  Schreiner 
1 99 la:  Winer  1984).  Therefore  no  attempts  at  e.xtent  nor¬ 
malization  have  been  made.  A  potential  consequence  of 
differences  in  the  absolute  extent  of  spatial  gradients  may¬ 
be  a  blurring  of  the  topographies  at  the  dorsal  and  ventral 
extremes  of  the  pooled  data  ( see  below ) . 

The  pooling  method  did  not  take  into  account  the  actual 
CF  of  the  units.  The  question  is.  then,  could  CF  sampling 
biases  have  strongly  affected  the  observed  spatial  band¬ 
width  distributions?  It  has  been  reported  that,  with  increas¬ 
ing  CF.  Q-faaors  of  cortical  neurons  show  an  overall  in¬ 
crease  and  relative  bandwidth  values  decrease  ( Phillips  and 
Irvine  1981).  Previous  studies  (Schreiner  and  Mendelson 
1 990;  Sutter  and  Schreiner  1 99 1  a )  had  shown  a  similar  dor¬ 
soventral  pattern  of  tuning  sharpness  across  all  frequencies, 
at  least  for  the  CF  range  >4  kHz.  which  is  the  same  range 
covered  in  the  current  study.  In  some  of  the  cases  included 
in  this  study,  in  particular  case  SUTC16.  CFs  at  locations 
near  the  ventral  and  dorsal  end  of  Al  tended  to  deviate  from 
the  projected  isofrequency  values  toward  higher  frequen¬ 
cies  (see  Fig  5).  These  were  also  the  regions  that  showed 
the  widest  integrative  excitatory  bandwidth.  Because  neu¬ 
rons  with  higher  CFs  also  tend  to  have  narrower  band- 
widths.  a  potential  CF-influence  would  tend  to  diminish 
the  magnitude  of  the  observed  bandwidth  differences  be¬ 
tween  the  dorsoventral  center  and  the  dorsal  and  ventral 
margins  of  AI  providing  an  aigurnent  against  a  substantial 
CF  influence  on  the  presented  data.  In  addition,  compari¬ 
son  of  the  variance  in  the  bandwidth  values  that  could  be 
attributed  to  either  the  dorsoventral  location  of  AI  or  the 
CF  also  suggested  a  relatively  .imail  systematic  influence  of 
CF  distribution  on  the  results.  This  supports  the  conclusion 
that  the  observed  global  excitatory  bandwidth  variations 
can  also  be  found  along  more  strictly  defined  isofrequency 
contours. 

Finally,  in  interpreting  this  data,  it  is  important  to  re¬ 
member  that  the  single  and  multiple  neuron  recordings 
from  these  experiments  were  limited  to  depths  between  600 
and  l.OOO  mhi  below  the  cortical  surface.  Therefore  the  ore- 
sented  spatial  distribution  of  single-neuron  properties  may 
be  limited  to  a  ponton  of  AI  columns,  approximately  corre¬ 
sponding  to  deep  layer  3  and  layer  4. 
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SinvU'-iinii  versus  muinple-tmit  hamiwidih 

The  multiple-unit  measurements  of  sharpness  of  tuning 
(e\citator>  bandwidth  10  and  40  dB  above  minimum 
threshold  I  confirm  the  spatial  pattern  in  the  dorsoventral 
dimension  of  Al  as  reported  for  corresponding  Q-values 
(Schreiner  and  Mendclson  1990:  Sutter  and  Schreiner 
1991a).  All  isofrequency  contours  contain  a  region  of 
narrow  integrated  excitatory  bandwidths  with  a  gradual 
transition  to  more  broadly  tuned  locations  toward  the  ven¬ 
tral  and  dorsal  ends  of  Al.  in  other  words,  the  excitatory 
contributions  that  are  available  at  different  cortical  loca¬ 
tions  along  the  isofrcquency  domain  appear  to  vary  system¬ 
atically.  What  is  the  source  of  this  gradation  in  the  inte¬ 
grated  e.xcitatory  bandwidth?  Pooled  single-neuron  data 
dorsal  to  the  minimum  in  integrated  bandwidth  40  dB 
above  threshold  showed  units  with  progressively  broader 
bandwidths  intermingled  with  sharply  tuned  neurons.  Lo¬ 
cal  integration  of  activity  from  &*veral  neurons  can  account 
for  the  observed  dorsal  bandwidth  gradient  as  reflected  in 
similar  slopes  of  the  linear  regression  for  multiple-  and  sin¬ 
gle-unit  BW40  (Fig.  7).  The  occurrence  of  neurons  with 
multipeaked  tuning  curves  in  dorsal  Al  (Sutter  and 
Schreiner  1991a)  contributed  to  the  broadening  of  the  mul¬ 
tiple-unit  BW40.  However,  neurons  with  a  single  excitatory 
response  area  also  broadened,  occasionally  exceeding  the 
total  bandwidth  of  multipeaked  neurons  (Fig.  7).  Broad 
single-peaked  neurons  did  not  necessarily  have  to  reside  in 
close  proximity  to  recorded  multip^ked  neurons. 

Ventral  to  the  minimum  in  integrated  bandwidth, 
pooled  BW40  values  of  single  neurons  did  not  reflect  the 
same  trend  as  seen  for  the  integrated  bandwidth.  All  single 
neurons  in  the  ventral  3.5  mm  of  Al  remained  essentially 
sharply  tuned,  that  is.  below  a  bandwidth  of  1  octave.  By 
contrast.  ~25%  of  the  multiple  units  in  that  area  had  band- 
widths  >  1  octave.  Accordin^y.  no  bandwidth  gradient  was 
observed  for  the  pooled  single-neuron  data.  Some  aspects  of 
the  pooling  method  may  have  contributed  to  this  discrep¬ 
ancy  between  multiple-  and  single-unit  measurements.  Sin¬ 
gle  units  that  were  located  in  All.  as  defined  by  multiple- 
unit  criteria  (Schreiner  and  Cynader  1984).  were  not  in¬ 
cluded  in  the  pooled  data.  A  strict  definition  of  the  AI/AII 
border  may  have  biased  the  single-unit  distribution  toward 
narrower  bandwidths  at  the  ventral  end  of  Al.  The  distance 
between  the  bandwidth  minimum  and  the  border  of  All 
could  vary  over  >2  ram,  even  within  a  single  case.  Because 
the  length  of  Al  was  not  normalized  for  the  pooling,  a  func¬ 
tional  gradient  between  BW40  minimum  and  All  could  be 
obscured  because  of  the  averaging.  Indeed,  in  one  of  the 
illustrated  single-unit  cases  (sec  Fig.  4).  a  BW40  gradient  is 
apparent  in  ventral  Al.  However,  the  single-unit  gradient 
appears  to  be  somewhat  shallower  than  the  multiple-unit 
gradient.  Additionally,  even  at  ventral  distances  as  close  as 
1  mm  from  the  bandwidth  minimum,  that  is  well  within 
Al.  the  bandwidth  discrepancy  between  single-  and  multi¬ 
ple-unit  recordings  was  apparent.  There  are  reasons  to  be¬ 
lieve.  then,  that  the  lack  of  congruence  in  BW40  for  single 
and  multiple  units  in  ventral  Al  is  not  solely  caused  by 
potential  methodological  influences  but  is  a  reflection  of 
physiological  properties  that  distinguish  ventral  from  dor¬ 
sal  Al  { see  below). 


Discrepancies  between  single-  and  multiple-unit  re¬ 
sponses  were  more  pronounced  for  the  measure  of  BW  lO 
Although  the  multiple-unit  BW  10  data  showed  similar  gra¬ 
dients  along  the  dorsoventral  extent  as  seen  for  BW40  in 
this  study  and  for  corresponding  Q-values  in  previous  stud¬ 
ies  (Schreiner  and  Mcndelson  1990;  Sutter  and  Schreiner 
1991a).  pooled  BW  1 0  values  for  single  neurons  showed  no 
statistically  significant  bandwidth  gradients  in  either  ven¬ 
tral  or  dorsal  Al.  The  difTcrcncc  between  single  and  multi¬ 
ple  units  v/as  most  strongly  expressed  in  ventral  Al  with 
constant  average  bandwidths  across  the  most  ventral  3  mm 
of  Ai.  In  the  region  dorsal  to  the  multiple-unit  BWlO  mini¬ 
mum.  single-neuron  BW  10  values  did  show  a  trend  similar 
to  the  multiple-unit  data  (sec  Fig.  90  and  paralleling  the 
BW40  results,  however,  without  reaching  statistical  signiti- 
cance.  Contributing  to  the  single/ multiple-unit  discrep¬ 
ancy  for  BWIO  in  the  dorsal  region  of  Al  was  the  fact  that 
the  different  peaks  in  multipeaked  tuning  curves  show  dif¬ 
ferences  in  the  minimum  threshold  of  each  peak  (Sutter 
and  Schreiner  1991a).  Consequently,  some  of  those  units 
are  represented  by  the  bandwidth  of  only  one  peak  and  not 
the  total  bandwidth  across  all  peaks,  because  minimum 
threshold  of  those  additional  peaks  have  not  been  reached 
10  dB  above  the  minimum  threshold  of  the  most  sensitive 
peak. 

It  is  concluded  that  sharply  tuned  neurons  can  be  found 
across  the  entire  dorsovenu^  extent  of  AI.  The  si>atial  varia¬ 
tion  in  integrated  bandwidth  in  the  dorsal  region  of  AI  is 
paralleled  by  an  increase  in  the  scatter  of  single-unit  band¬ 
width  and  by  an  increasing  occurrence  of  multipeaked  neu¬ 
rons  toward  the  dorsal  end  of  Al.  By  contrast,  the  increase 
of  integrated  bandwidth  toward  the  All  border  is  not  neces¬ 
sarily  paralleled  by  an  increase  in  single-unit  bandwidth.  In 
the  approximate  dorsoventral  center  of  AI  is  a  region  of 
sharply  tuned  single-  as  well  as  multiple-unit  responses. 

Influence  ofCF  scalier  on  bandwidlh  distribuiion 

The  apparent  discrepancy  between  single-  and  multiple- 
unit  estimates  of  the  cxciuitory  bandwidth  in  the  ventral 
portion  of  AI  was  suggestive  of  the  presence  of  a  change  in 
the  local  scatter  of  vTFs  especially  from  central  AI  to  the 
ventral  boundary.  The  physiological  dorsoventral  center  of 
Al.  dehned  by  a  narrow  multipie-unit  bandwidth,  can  only 
show  little  CF  scatter  of  single  neurons  because  single-  and 
multiple-unit  responses  are  equally  sharply  tuned.  An  in¬ 
creasing  scatter  in  CF  toward  the  dorsal,  and  especially  ven¬ 
tral.  borders  of  AI  could  account  for  the  observed  increase 
in  multiple-unit  bandwidth. 

Although  a  proper  estimate  of  the  distribution  of  single¬ 
unit  CF  scatter  in  AI  should  be  done  by  analyzing  several 
single  neurons  at  each  sampled  cortical  location  (e.g.,  Hui 
et  al.  1989),  the  currently  applied  method  of  comparing 
multiple-unit  CFs  with  single-neuron  CFs  should  provide  a 
reasonable  approximation.  The  main  justifications  are  that 
/ )  multiple-unit  CFs  are  likely  close  to  the  average  of  the 
contributing  single-neuron  CFs.  and  2 )  multiple-unit  CF 
distributions  across  AI  show  a  very  high  degree  of  cochleo- 
topicity  (e.g..  Mcrzenich  et  al.  1975;  Rcale  and  Imig  1980; 
Schreiner  and  Mcndelson  1990).  Although  isofrcquency 
contours  (as  determined  with  the  multiple-unit  method! 
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are  not  even-  where  as  straight  as  in  the  central  sector  of  Al. 
inlluencesofsystematiccuiNaturesin  the  orientation  ofiso- 
I'requency  contours  le.g..  Sutter  and  Schreiner  1991a)  at 
the  dorsal  and/or  ventral  ends  on  the  CF  scatter  estimate 
arc  likely  to  be  small,  because  the  estimates  are  either  from 
the  same  locations  as  the  multiple-unit  CFs  ( i.e..  are  inde¬ 
pendent  from  the  oneniation )  or  are  denved  from  several 
ne.xt  neighbors,  thus  taking  potential  local  changes  in  orien¬ 
tation  into  account.  The  current  attempt  to  analyze  the 
local  consistency  in  the  CF  of  conical  neurons  indicates 
then  that  the  hypothesized  spatial  change  in  CF  scatter  of 
single  neurons  is  likely  to  be  true.  Combining  the  range  of 
CF  scatter  with  the  range  of  single-neuron  bandwidth  for 
each  Al  sector  results  in  a  good  approximation  of  the  over¬ 
all  e.xcitatory  bandwidth  distribution  obtained  with  multi¬ 
ple-unit  recording. 

How  broad  multiple-unit  responses  can  emerge  from 
narrow  single-neuron  responses  can  be  viewed  by  consider¬ 
ing  multiple-unit  responses  as  the  total  integrated  excit¬ 
atory  response  of  the  entire  recording  site  ( Schreiner  and 
Mendelson  1990).  The  multiple-unit  recording  could  re¬ 
flect  all  inputs,  inhibitory  and  excitatory,  because  this 
method  can  potentially  record  from  somas.  dendrites,  and 
axons  of  different  neurons.  With  this  in  mind,  the  single/ 

lultiple-unit  differences  in  bandwidth  encountered  in 
ventral  AI  might  be  due  to  the  inclusion  of  spikes  from 
proximal  inhibitory  processes  in  the  vicinity  of  a  postsynap- 
tic  neuron  into  the  multiple-unit  response.  Therefore  broad 
excitatory  responses  mi^t  be  due  to  broad  inhibitory  in¬ 
puts  of  narrowly  tuned  ventral  neurons.  If  all  inputs  to  a 
given  cortical  location,  producing  either  inhibition  or  exci¬ 
tation.  were  reflected  in  multiple-unit  recordings,  dorsally 
located  multipeaked  single  neurons  should  be  accompa¬ 
nied  by  broad  single-ipeaked  multiple-unit  recordings  at  the 
same  location,  which,  indeed,  was  often  the  case  ( personal 
observation ). 

Parceling  of  A! 

Dividing  AI  into  three  different  regions  provided  a  direct 
method  of  addressing  questions  regarding  local  differences 
between  single-  and  multiple-unit  bandwidth  measures. 
The  selection  of  three  areas  was  based  on  the  distribution  of 
the  integrated  bandwidth  alone.  The  combined  results  from 
single  and  multiple  units  suggest  that,  physiologically.  AI 
can  be  parceled  into  two  regions:  a  ventral  (AIv)  and  a 
dorsal  ( Aid )  region.  The  bases  of  dividing  these  regions  are 
/ )  a  reversal  of  the  global  gradient  of  integrated  bandwidth 
along  the  dorsovcntral  axis  of  AI,  2)  regional  differences  in 
the  distribution  of  BW40  for  single  neurons,  and  3)  the 
observation  of  neurons  with  multiple-peaked  tuning  curves 
that  appear  to  be  limited  to  the  dorsal  part  of  Al. 

A  significant  difference  in  the  spectral  integration  proper¬ 
ties  of  the  granular  and  supragranular  layers  of  Aid  and  AIv 
implies  that  there  should  be  fundamental  differences  in  the 
spectral  processing  in  the  two  regions.  Functionally,  Aid 
may  be  particularly  well-suited  for  an  integrative  analysis  of 
broadband  stimuli  by  responding  relatively  undifferen¬ 
tiated  to  tones  or  spectral  peaks  of  different  frequencies. 
This  is  reflected  in  generally  broader  excitatory  tuning  of 
single  neurons  as  well  as  by  multipeaked  neurons.  Accord¬ 


ingly.  a  high  responsiveness  to  broadband  stimuli  was  ob¬ 
served  in  Aid  ( Schreiner  and  Mendelson  I990;  Sutter  and 
Schreiner  1991a).  However,  whether  a  true  integrative  pro¬ 
cessing.  e.g..  characterized  by  lower  thresholds  for  broad¬ 
band  signals  compared  with  those  for  tones,  is  present  m 
•Aid  remains  to  be  seen. 

Locations  in  .AIv  may  be  more  suited  for  a  differential 
analysis  of  the  spectral  properties  of  broadband  stimuli  be¬ 
cause  they  arc  characterized  by  sharply  tuned  single  neu¬ 
rons  distributed  over  varying  ranges  of  CFs.  In  other  words, 
details  of  the  spectral  distribution  of  broadband  signals  ap¬ 
pear  to  be  resolved  and  preserved  at  given  locations  in  AIv . 
The  responsiveness  to  broadband  stimuli  (clicks  or  vvhiie 
noise )  is  generally  smaller  than  in  Aid  ( Schreiner  and  Men¬ 
delson  1990).  because  of  the  relatively  sharply  tuned  single 
neurons.  The  distribution  of  other  response  parameters,  in 
particular  minimum  threshold,  monotonicity  of  rate/ lev  el 
functions,  and  binaurality.  have  to  be  taken  into  account  to 
derive  a  more  complete  interpretation  of  the  implications 
of  the  described  functional  topography  of  Al. 

Relation  to  previous  studies 

The  results  of  this  paper  may  help  to  resolve  discrepan¬ 
cies  among  various  reports  of  CF  topography  and  sharpness 
of  tuning  in  AI  obtained  with  single-  or  multiple-unit  mea¬ 
sures.  A  highly  systematic  CF  topography,  the  cochicoiopic 
organization,  has  been  reliably  found  in  cat  Al  when  apply¬ 
ing  the  multiple-unit  recording  technique  (e.g..  Merzenich 
ctal.  1975;  Reale  and  Imig  1980:  Schreiner  and  Mendelson 
1990).  By  contrast,  several  attempu  to  establish  the  co- 
chleotopic  organization  of  AI  with  single-unit  recordings 
have  indicated  limitations  on  the  precision  of  such  an  orga¬ 
nization  on  the  sin^c-neuron  level  ( Abelcs  and  Goldstein 
1970a.b:  Bogdanski  and  Galambos  I960:  Erulkar  et  al. 
1956;  Evans  and  Whitfield  1964:  Evans  et  al.  1965:  Gold¬ 
stein  etal.  1968. 1970).  Investigators  who  reported  a  lack  of 
CF  consistency  also  report  a  wide  range  of  tuning  sharpness 
( 50%  >0.5  octaves)  and  a  relatively  high  percentage  of  mul- 
tipcaked  neurons.  Other  studies  have  claimed  that  almost 
all  units  in  AI  were  sharply  tuned  (e.g..  Phillips  and  Irvine 
1981).  The  recording  of  predominantly  sharply  tuned  neu¬ 
rons  was  possibly  due  to  recording  only  from  the  ventral 
region  of  Al  and/or  a  reliance  on  Q-  lOdB  as  the  measure  of 
sharpness. 

Some  of  the  inconsistencies  between  multiple-unit  and 
earlier  single-unit  mapping  studies  may  be  accounted  for  by 
differences  in  the  extent  of  the  considered  cortical  area,  spa¬ 
tial  scatter  induced  by  pooling  acros.s  animals,  and  by  the 
state  of  anesthesia  ( Merzenich  et  al.  1975 ).  However,  some 
discrepancies  remain  such  as  the  range  of  CF  scatter  found 
in  sin^e.  near-radial  penetrations  of  AI  ( Abeles  and  Gold¬ 
stein  1970a,b;  Evans  and  Whitfield  1964:  Goldstein  et  al. 
1968;  Phillips  and  Irvine  1981 ).  The  cochlcoiopicity  ob¬ 
tained  with  the  multiple-unit  technique  is  derived  from  the 
minimum  threshold,  that  is  from  the  most  sensitive  neuron 
or  from  the  average  CF  of  the  most  sensitive  neurons  w  iihin 
each  cluster.  For  single-neuron  studies,  an  apparent  lack  of 
CF  topography  or  even  CF  consistency  within  the  same 
penetration  (Evans  and  Whitfield  1964;  Goldstein  ct  al. 
1968.  1970;  Katsuki  et  al.  I959a,b)  could  be  due  to  record- 
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mgs  from  neurons  whose  thresholds  and  CFs  are  different 
from  the  most  sensitive  neuron ( si  near  the  recording  loca¬ 
tion.  The  observations  in  the  current  stud>  indicate  that  the 
precision  in  the  spatial  CF  distribution  of  single  neurons 
changes  gradually  across  Al  from  a  strict  local  alignment  in 
the  central  1-2  mm  of  its  dorsoventral  e.xtcnt  to  a  decreas- 
ingK  strict  alignment  toward  the  dorsal  and  ventral  bound¬ 
aries  of  Al.  This  loss  of  precision  in  single-unit  CF  organiza¬ 
tion  is  present  in  areas  that  still  show  a  high  degree  of  co- 
chleotopic  organization  when  tested  with  multiple-unit 
recordings  and.  therefore,  has  to  be  distinguished  from  the 
scatter  of  multiple-unit  CFs  that  is  encountered  in  the  dor¬ 
sal  portion  of  .All  ( Schreiner  and  Cynader  1984). 

These  findings  suggest  that  the  integrated  multiple-unit 
response  is  reflecting  the  mode  of  the  CF  distribution  at  a 
given  location.  Single-unit  studies  usually  provide  only  a 
single  sample  per  location  that  is  subject  to  the  variance  in 
the  composition  of  the  local  cell  assembly.  Accordingly,  the 
notion  of  a  strictly  defined  isorepresentational  frequency 
domain,  i.e..  the  pure  isofrequency  contour,  can  only  be 
maintained  for  integrative  response  measures  or  for  a 
narrow  1-  to  2-mm-wide  portion  at  the  dorsoventral  center 
of  Al.  The  suggestion  of  a  changing  variance  in  the  CF 
distribution  of  single  neurons  along  the  dorsoventral  axis  of 
Al  consolidates  the  picture  of  the  precision  in  frequency 
organization  as  obtained  with  these  different  recording 
methods.  Additionally,  it  provides  some  of  the  physiologi¬ 
cal  basis  fur  the  observation  of  changing  integrate  band- 
widths  in  the  cortical  dimension  orthogonal  to  the  cochlco- 
topic  gradient. 

Recent  studies  in  primates  and  birds  have  indicated  that 
the  systematic  spatial  distribution  of  integrative  excitatory 
bandwidth,  as  described  earlier  (Schreiner  and  Mendetson 
1 990;  Sutter  and  Schreiner  1991a)  and  in  this  report,  might 
be  a  general  organizational  property  of  the  primary  audi¬ 
tory  telencephalic  field.  The  integrated  excitatory  band¬ 
width  changes  systematically  along  the  isofrequcncy  do¬ 
main  in  Al  of  the  owl  monkey  ( Mcrzenich  et  al.  1 99 1 )  in  a 
similar  fashion  as  seen  in  the  cat.  In  a  combined  multiple- 
and  single-unit  study  of  the  neostriatal  field  L  in  the  chick, 
sharp  tuning  was  found  in  the  approximate  center  of  the 
isofrequency  domain  (Hcil  and  Scheich  1991 ).  As  in  the 
two  mammalian  species,  an  increase  in  breadth  of  tuning 
gradually  occurs  with  recording  sites  funheraway  from  that 
point. 

The  pooling  across  animals  in  these  studies  does  not  al¬ 
low  to  distinguish  unequivocally  whether  the  observed 
global  dorsoventral  gradients  in  excitatory  bandwidth  in¬ 
deed  reflect  a  relatively  smooth  gradient  in  individual  cases 
( due  to  increases  in  scatter  of  CF  or  bandwidth  of  the  un¬ 
derlying  single  neurons)  or  arc  the  consequence  of  a  more 
patchy  or  “columnar”  organization  of  clustered  neurons 
with  similar  bandwidth  charactcnstics.  The  latter  hypothe¬ 
sis  is  suggested  by  the  patchy  distribution  of  several  ana¬ 
tomic  and  functional  aspects  of  Al  organization  ( Imig  and 
Adrian  1977-.  Matsubara  and  Phillips  1988;  Phillips  ct  al. 
1985;  Rajan  ct  al.  1990;  Wallace  et  al.  1991).  Signs  of  a 
certain  functional  patchiness  is  also  apparent  in  individual 
cases  of  the  maps  obtained  in  this  scries  for  spectral  as  well 
as  intensity  parameters  (sec  Schreiner  and  Mendeison 
1990;  Schreiner  et  al.  1992).  However,  the  strictness  of 


such  a  hypothetical  columnar  bandwidth  organization 
would  have  to  be  fairly  limited  because  any  combination  of 
single-peaked  broad,  single-peaked  narrow,  and  multi- 
peaked  broad  neurons  can  be  encountered  in  a  given  pene¬ 
trations  in  the  dorsal  half  of  Al.  The  notions  of  / )  general 
trends  in  the  functional  organization  of  the  dorsoventral 
e.xtcnt  of  A 1  as  expressed,  forexample,  in  the  distribution  of 
e.xcitatory  bandwidth  and  tonotopicity  and  2)  the  occur¬ 
rence  of  cortical  patches  with  similar  functional  attributes 
are  not  necessarily  incompatible.  1‘  may  be  argued  that  gen¬ 
eral  spatial  trends  exist  in  the  organization  of  primary  audi¬ 
tor)'  cortex  and  that  these  trends  are  modulated  by  small, 
functionally  more  coherent  patches.  How  the  physiological 
distinctions  of  neurons  along  the  main  gradients  and  within 
different  cortical  patches  can  be  explained  within  func¬ 
tional.  morphological,  and/or  projcctional  frameworks  of 
cortical  processing  remains  to  be  seen. 

In  conclusion,  this  study  reports  a  physiological  frame¬ 
work  of  the  dorsoventral  extent  of  Al  that  is  based  on  the 
spatial  distribution  of  spectral  parameters,  namely  the  e.xcii- 
atory  bandwidth  (sharpness  of  tuning)  and  the  characteris¬ 
tic  frequency  of  single  as  well  as  local  groups  of  neurons  in 
Al.  The  presented  evidence  is  compatible  with  the  notion 
that  Al  can  be  divided  into  at  least  two  functionally  separa¬ 
ble  regions:  dorsal  Al  (Aid)  and  ventral  Al  (Alv).  Func¬ 
tional  interpretations  of  the  role  of  Al  from  physiological 
properties  of  single  neurons  need  to  take  these  topographi¬ 
cally  based  representational  principles  into  account. 
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Summary.  The  neuronal  response  to  tones  as  a  function 
of  intensity  was  topographically  studied  with  multiple- 
unit  recordings  in  the  primary  auditory  cortex  (AI)  of 
barbiturate-anesthetized  cats.  The  spatial  distribution  of 
the  characteristics  of  rate/levei  functions  was  determined 
in  each  of  three  intensely  studied  cases  and  their  relation¬ 
ship  to  the  distribution  of  spectral  parameters  (sharpness 
of  tuning  and  responses  to  broadband  transients)  in  the 
same  animals  was  determined.  The  growth  of  the  high- 
intensity  portion  of  rate/levcl  functions  was  estimated  by 
linear  regression.  Locations  with  monotonically  growing 
high-intensity  portions  were  spatially  segregated  from 
locations  with  nonmonotonic  rate/'level  functions.  Two 
noncontiguous  areas  with  a  high  degree  of  non- 
monotonicity  were  observed.  One  was  located  at  the 
dorsoventrul  center  of  AI,  and  a  second  in  the  dorsal 
third  of  AI.  The  more  ventral  aggregate  of  high  non- 
monotonicity  coincided  with  the  region  of  sharp  fre¬ 
quency  tuning.  The  stimulus  levels  that  produced  the 
highest  bring  rate  (strongest  response  level,  SRL)  at  any 
sampled  location  ranged  fiom  10  to  80  dB  sound  press¬ 
ure  level  (SPL).  Several  spatial  aggregates  with  either 
high  or  low  SRLs  were  observed  in  AI.  The  region  of 
shatpest  tuning  was  always  associated  with  u  region  of 
low  SRLs.  The  response  threshold  to  contralateral  tones 
at  the  characteristic  frequency  (CF)  ranged  from  — 10  dB 
SPL  to  85  dB  SPL  with  the  majority  between  0  and  40 
dB  SPL.  The  spatial  distribution  of  response  thresholds 
indicated  several  segregated  areas  containing  clusters 
with  cither  higher  or  lower  response  thresholds.  The 
correlation  of  response  threshold  with  integrated  band¬ 
width  and  transient  responses  was  only  weak.  Low-  and 
high-intensity  tones  of  the  same  frequency  arc  represent¬ 
ed  at  different  locations  in  AI  as  judged  by  the  amount 
of  evoked  neuronal  activity  and  arc  largely  independent 
of  the  frequency  organization.  The  spatial  distribution  of 
locations  with  high  monotonicity  and  low  strongest  re¬ 
sponse  levels  were  aligned  with  the  organization  of  the 
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integrated  excitatory  bandwidth  and  cuvaried  with  the 
response  strength  to  broadband  stimuli. 

Key  words:  Primary  auditory  cortex  -  Intensity  -  Isofrc- 
quency  domain  -  Topography  -  Cat 


Introduction 

Studies  of  the  spatial  distribution  of  several  functional 
response  properties  in  mammalian  primary  auditory  cor¬ 
tex  (AI)  have  revealed  systematic  pftysiological  organiza¬ 
tions.  The  most  prominent,  and  best  documented,  or¬ 
ganizational  features  are  a  cochleotopic  arrangement  of 
the  characteristic  frequency  (CF)  of  neurons  (  Merzenich 
et  al.  1975;  Reale  and  Imig  1980;  Woolsey  and  Walzl 
1942)  and  the  spatial  segregation  of  diiTerent  binaural 
integrations  of  excitatory  and  inhibitory  inputs  into  “bi¬ 
naural  interaction  bands”  (Imig  and  Adrian  1977; 
Middlebrooks  ct  al.  1980).  There  is  additional  -  although 
in  some  cases  only  preliminary  -  evidence  that  other 
response  properties  are  also  spatially  segregated  or  sys¬ 
tematically  organized  within  AI.  Among  those  properties 
arc  aspects  of  responses  related  to  sound  localization 
(c.g..  Imig  ct  al.  1990;  Jenkins  and  Merzenich  1984; 
Middlebrooks  and  Pettigrew  1981;  Rajan  ct  al.  1990; 
Reale  and  Kcttncr  1986).  inhibitory  interaction  (  Sham- 
ma  and  Fleshman  1990;  Suga  and  Tsuzuki  1985).  sharp¬ 
ness  of  tuning  (Schreiner  and  Cynader  1984;  Schreiner 
and  Mendelson  1990;  Sutter  and  Schreiner  1991a),  and 
the  growth  of  response  strength  (Phillips  et  al.  1985; 
Rcalc  ct  al.  1979). 

In  an  attempt  to  assess  the  global  distribution  and 
possible  covariances  of  functional  characteristics  within 
Al.  12  properties  of  multiple-unit  responses  along  the 
dorsoventral  extent  of  cat  AI  were  investigated 
(Schreiner  et  al.  1988;  Mendelson  et  al.  1988).  In  an 
initial  report  of  this  experimental  series,  the  spatial  distri¬ 
bution  of  the  “integrated  excitation*'  as  reflected  in  the 
sharpness  of  tuning  curves  and  the  response  strength  to 
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broadband  transients  was  described  (Schreiner  and  Mcn- 
delson  1990).  A  consistent  systematic  distribution  of  the 
spectrally  integrated  excitation  along  the  isofrequency 
domain  was  observed,  providing  further  evidence  for  a 
systematic  internal  organization  of  AI.  (Note  that  the 
term  "isofrequcncy"  is  used  here  to  describe  a  hypotheti¬ 
cal  line  along  the  cortical  dimension  that  is  oriented 
orthogonal  to  the  main  CF  gradient.  Whether  indeed 
there  are  true  isofrequency  contours  across  AI  contain¬ 
ing  neurons  of  exactly  the  same  CF  has  never  been 
shown.) 

The  current  report  describes  the  spatial  distribution  of 
neural  response  properties  relating  to  the  representation 
of  pure  tones  as  a  function  of  intensity  for  the  same 
cortical  locations  that  were  studied  regarding  excitatory 
bandwidth  and  responses  to  transients  (Schreiner  and 
Mendelson  1990).  Several  aspects  of  the  changes  in  firing 
rate  with  stimulus  level  as  they  arc  reflected  in  ratc/level 
functions  of  multiple-unit  responses  were  utilized  in  fur¬ 
ther  exploring  the  spatial  organization  of  AI  in  the  anes¬ 
thetized  cat.  The  studied  parameters  are  response  thresh¬ 
old.  dynamic  range,  monotonicity,  and  strongest  re¬ 
sponse  level  (SRL).  In  particular,  the  relationship  of  the 
spatial  distribution  of  intensity  parameters  with  that  of 
the  excitatory  bandwidth  and  responses  to  broadband 
stimuli  (Schreiner  and  Mendelson  1990)  will  be  dis¬ 
cussed. 


Materials  and  methods 

Surgical  preparation 

The  methods  are  the  same  as  described  in  previous,  reia'ed  reports 
(Schreiner  and  Mendelson  1990;  Sutter  and  Schreiner  1991a). 
Briefly,  results  were  obtained  in  the  right  hemispheres  of  aduit 
cats.  Anesthesia  was  induced  with  an  intramuscular  injection  of 
ketamine  hydrochloride  (10  mg/kg)  and  acetylpromazine  maieate 
(0. 10  mg/kg).  Animals  were  maintained  at  a  surgical  level  of  anes¬ 
thesia  with  a  continuous  infusion  of  pentobarbital  sodium  (2  mg/kg 
per  hour)  in  iactated  Ringer's  solution.  The  temperature  of  the 
animals  was  monitored  with  a  rectal  probe  and  maintained  at 
37.5’’  C  by  means  of  a  heated  water  blanket  with  feedback  control. 

The  head  was  fixed  and  the  temporal  muscle  on  the  right  hemi¬ 
sphere  was  then  retracted  and  the  lateral  cortex  exposed  by  a 
craniotomy.  The  dura  overlying  the  middle  ectosylvian  gyrus  was 
removed,  the  exposed  cortex  covered  with  silicone  oil.  and  a  photo¬ 
graph  of  the  surface  vasculature  taken  to  record  the  electrode 
penetration  sites. 


Stimulus  generation  and  deliiery 

Experiments  were  conducted  in  a  sound-shielded  room  (industrial 
Acoustics  Company).  Auditory  stimuli  were  presented  via  cali¬ 
brated  headphones  (STAX  54)  enclosed  in  small  chambers  that 
were  connected  to  sound  delivery  lubes  sealed  into  the  acoustic 
meatuses  (Sokolich.  US  Patent  4251686;  1981).  The  frequency 
response  of  the  system  was  essentially  flat  up  to  12  kHz  and  did  not 
have  major  resonances  deviating  more  than  6  dB  from  the  average 
level  Above  15  kHz.  the  output  rolled  off  at  a  rale  of  10  dB. octave. 

Tones  were  generated  by  a  microprocessor  (TMS320iO;  16-bit 
digital-analog  (D' A)  converter  at  120  kHz;  low-pass  filter  at  35  kHz 
or  50  kHz).  Additional  attenuation  was  provided  by  a  pair  of 
passive  attenuators  (Hewlett-Packard).  The  duration  of  the  lone 


bursts  was  50  ms  including  3  ms  rise  fail  time.  The  intcrstimulus 
interval  was  500-1000  ms. 

For  each  recording  site,  responses  were  recorded  to  at  least  6”5 
diflTereni  tone  bursts.  Tone  bursts  were  presented  in  a  pseudoran¬ 
dom  sequence  of  different  frequency  level  combinations  selected 
from  1 5  level  values  (70  dB  range)  and  45  frequency  values  lequidis- 
lant  on  a  logarithmic  frequency  scale;  2-4  octaves  range).  From  the 
responses  to  all  stimuli,  frequency  response  areas  (FR.As)  were 
reconstructed. 


Recording  procedure 

Parylene-insulatcd  tungsten  microelectrodes  with  impedances  at 
I  kHz  of  0.8-1. 3  Mf)  were  used.  All  penetrations  were  roughly 
orthogonal  to  the  brain  surface.  The  recordings  were  derived  at  an 
intracortical  depth  ranging  from  6(X)  to  1000  pm.  as  determined  b> 
the  microdrive  setting,  roughly  corresponding  to  cortical  layers  III 
and  IV.  Neuronal  activity  of  single  units  or  small  groups  of  neurons 
(2-6  units)  were  amplified,  band-pass  filtered,  and  monitored  on  an 
oscilloscope  and  an  audio  monitor.  Spike  activity  was  separated 
from  the  background  noise  with  a  level  discriminator  (BAK  DIS-I ) 
set  at  more  than  50%  of  the  background  noise  and  stored  in  a 
computer  (DEC  11/73).  The  recording  window  had  a  duration  of 
50  rns.  corresponding  to  the  stimulus  duration. 


Data  analysis 

From  the  FRAs,  response  strengths  to  the  CF  and  the  two  stimulus 
frequencies  closest  to  the  CF  were  extracted.  .Spike  count  level 
profiles  were  constructed  for  each  recording  site  by  summing  the 
spike  counts  for  each  signal  level  produced  by  these  frequencies,  i.e.. 
responses  from  45  signal  presentations  (15  levels  for  three  fre¬ 
quencies)  were  utilized.  The  following  response  parameters  were 
obtained: 

A.  Contralateral  threshold,  the  lowest  level  of  a  contralateral  tone 
(expressed  in  dB  !^PL)  that  reliably  evoked  a  response.  This  re¬ 
sponse  parameter  was  directly  determined  from  audiovisual  cues  of 
the  raw  response  u'jLing  stimulation  with  contralateral  tones  while 
manually  controlling  stimulus  frequency  and  level.  This  measure 
was  used  for  the  representation  of  the  response  threshold. 

From  the  rate/level  functiotis,  the  following  response  parameters 
were  extracted; 

B.  Transition  point,  the  point  in  the  rate.level  function  that  marked 
the  transition  from  a  fast-growing,  low-level  poniori  to  a  less  fast 
growing,  saturating  or  decreasing  high-level  portion  of  the  rate  level 
function.  In  virtually  all  cases,  a  monoionic.  fast-growing  portion 
could  be  distinguished  that  rose  from  the  response  threshold  (sec 
Fig.  1.  open  arrows)  to  a  iranstlicn  point  (see  Fig.  i.  filled  ariow). 
i.e..  a  clear  transition  to  a  less  fast  growing  portion  of  the  rate  level 
profile,  a  saturation,  or  to  a  decline  of  firing  rate. 

C.  Oynamk  range,  the  ntonotonic.  fsst-growing.  low-level  portion 
of  a  rate.'level  function.  The  dynamic  range  was  rounded  to  the 
nearest  multiple  of  5  dB. 

D.  SRL.  the  stimulus  level  (in  dB  SPL)  evoking  the  highest  firing 
rate  tn  the  range  covered  by  the  rate/lcvci  profile.  If  the  rate  level 
function  was  monoiontcally  growing  up  to  the  highest  level  used  for 
the  reconstruction,  the  highest  level  was  used  as  an  estimate. 

E.  Monotonicity,  the  rate  of  change  in  the  rate.level  profile  for 
stimulus  levels  above  the  transition  point  of  the  function.  To  quan¬ 
titatively  detennine  the  degree  of  rate  change  or  the  amount  of 
monoioniciiy  above  the  transition  point,  the  high-level  slope  of  the 
rate;  level  profile  was  estimated  by  a  linear  regression  analysis.  The 
starting  point  of  the  regression  analysis  was  at  the  transition  point 
of  the  rate.'lcvel  profile  (Fig.  1.  filled  arrow).  The  end  point  was 
either  the  rate  at  the  highest  stimulus  level  used  to  construct  the 
profile  or  the  stimulus  level  at  which  the  firing  rate  fell  below  90% 
of  the  firing  rate  of  the  transition  point  minus  spontaneous  rate. 
A  minimum  of  five  level  values,  including  the  transition  point  itself. 
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#706/28  DR-20  dB 

CF-11.4kH2  SRL-40  dB 
Thr.-20dB  Mon— 2.14 


#706/70  DR-10  dB 

CF-14.3kHz  SRL-IOdB 
Thr.-OdB  Mon— 1.14 


#706/42  DR-15  dB 

CF-10.5  kHz  SRL-20dB 
Thr.-5  dB  Mon-0.0 


#706/30  DR-15dB 
CFu  10.5  kHz  SRL-70dB 
Thr.-ISdB  Mon-t-0.96 


Lov«l  (dB  SPL)  L«v«l  (dB  SPL) 


Fig.  1.  Spike  count  level  functions  of  primary  auditory  cortex  (All. 
Four  representative  examples  of  spike  count  level  functions  are 
shown.  All  were  obtained  in  the  same  animal  (87-706).  Response 
thresholds  are  indicated  by  optn  arrows.  The  transition  point  (see 
Materials  and  methods)  is  marked  by  a  filled  arroK.  The  strongest 
response  level  (SRL)  of  each  location,  i.e..  the  stimulus  level  that 
prc^uced  the  highest  firing  rate,  is  indicated  by  a  star.  The  dashed 
line  in  the  high-level  portion  of  the  function  is  the  linear  regression 


had  to  be  included  in  the  analysis,  or  the  cortical  location  was 
excluded  from  further  consideration  of  this  parameter.  The  slope  of 
the  resulting  regression  function  (percentage  decibels:  Fig.  1. 
dashed  lines]  was  taken  as  measure  of  the  monotonicity  of  each 
recording  site. 


Data  representation 

Pseudo  three-dimensional  projections  and  contour  plots  were  uti¬ 
lized  to  represent  the  spatial  distribution  of  response  parameters 
across  the  cortical  surface  (topography  software  from  Golden  Soft¬ 
ware:  see  Schreiner  and  Mendelson  1990).  The  actual  spatial  loca¬ 
tions  of  the  recording  sites  were  used  to  generate  a  two-dimensional 
grid  of  the  represented  area  by  projecting  the  actual  sites  to  the 
nearest  grid  point.  Elevation  of  the  grid  surface  corresponded  to  the 
spuiialiy  averaged  magnitude  of  a  functional  parameter  at  a  given 
site.  The  grid  program  employed  a  weighted  inverse  distance 
squared  nlgonthm  for  (he  interpolation  of  all  grid  points.  The  result 
of  the  tiansformaiions  provided  a  faithful  although  slightly  am¬ 
plitude  compressed  picture  of  the  spatial  distribution  of  the  original 
information  (see  Fig.  i  us  Schreiner  and  Mendelson  1990). 

5mre  the  pseudo  three-dimensional  projection  of  (he  maps  al¬ 
ways  resulted  in  some  peispective  distortions,  an  undistorted  view 
of  the  spatial  distribution  is  provided  by  presenting  detailed  contour 
plois  (line  segments  that  connect  interpolated  locations  with  equal 
surface  elevaiion  or  z-values)  of  one  of  the  three  cases.  To  judge  the 
correspondence  of  the  data  values  with  their  three-dimensional 
representation,  the  value  and  location  of  each  data  point  is  provided 
for  one  exemplary  case.  The  distribution  of  recording  locations  for 
ihe  other  two  maps  as  well  as  the  corresponding  distributions  of 
churactenstic  frequency,  sharpness  of  tuning,  and  transient  re¬ 
sponses  can  be  obtained  from  the  maps  shown  in  the  first  pan  of 
this  study  (Schreiner  and  Mendelson  1990). 


Results 

Response  threshold 

Response  thresholds  to  contralateral  stimulation  with 
CF  tones  were  obtained  foi  261  locations  in  the  high- 


cstimuic  of  (he  spike  count  as  a  function  of  level.  The  slope  or 
monotonicity  i  .Vioni  of  the  function  is  indicated  tihnt  f  nn/i  i.vuni- 
p/**  expressed  In  percentage  per  decibels  l%  dB:  lOOIS  corresponds 
to  the  firing  rate  at  the  transition  point).  Chaructciisiic  frequency 
(Cfl.  threshold  (T/ir..-  in  clB  SPL).  dypumic  range  t/ffil.  level  runge 
between  threshold  and  transition  point),  and  SRL  idB sound  press¬ 
ure  level  SPL)  are  also  given  for  each  example 


frequency  domain  of  AI  for  CFs  between  5.8  and 
26.3  kHz.  AI  was  identified  by  its  rostrocaudal  CF  gra¬ 
dient.  The  dorsoventral  extent  of  the  mapping  was  cur¬ 
ried  out  until  clear  deviations  from  the  cochleotopic 
organization  were  encountered,  which  were  presumed  to 
indicate  that  the  end  of  AI  was  reached  (Middlebrooks 
and  Zook  1983;  Schreiner  and  Cynadcr  1984)  The  loca¬ 
tions  of  the  mapped  areas  on  the  right  hemisphere  of  cat 
cortex  are  schematically  shown  in  Fig.  2  (lightly  shaded 
regions).  The  orientation  of  the  isofrcqucncy  domain 
(thin  line)  is  indicated  by  an  estimated  isofrequency  con¬ 
tour  (for  details  see  Schreiner  and  Mendelson  1990. 
Fig.  4).  The  function?*!  border  to  the  second  auditory 
field  (All;  Fig.  2.  dashed  line)  was  estimated  from  the 
local  distribution  of  CFs.  the  sharpness  of  tuning,  and 
the  response  threshold  (Schreiner  and  Cynadcr  1984) 

The  distributions  of  the  encountered  response  thresh¬ 
olds  in  the  studied  cortices  is  shown  as  a  function  of  CF 
in  Fig.  3.  For  any  given  CF.  a  range  of  response  thresh¬ 
olds  was  obscp;ed  that  extended  over  20  dB  to  more  than 
50  dB.  Mean  and  standard  deviation  of  the  threshold 
values  arc  given  in  Table  1  for  each  case  separately.  For 
each  animal,  the  lowest  response  thresholds  were  be¬ 
tween  -  10  and  10  dB  SPL,  indicating  a  similar  degree 
of  sensitivity. 

Reconstructions  of  the  spatial  distribution  of  response 
thresholds  clearly  indicated  that  the  obtained  range  of 
thresholds  was  not  randomly  distributed  across  the 
mapped  area.  Figure  4A  illustrates  the  three-dimensional 
reconstruction  of  case  87-518,  showing  high  response 
thresholds  in  the  ventral  portion  of  the  mapped  area, 
systematically  decreasing  thresholds  toward  a  region 
with  predominantly  low  thresholds  in  the  middle  of  the 
map,  and  a  moderate  increase  in  response  threshold 
toward  the  dorsal  third  of  the  map.  This  latter  area  of  the 
map  showed  some  regional  variability  in  the  threshold 
values.  The  relationship  between  the  orientation  of  iso- 
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Hi{.  2.  Si'hemauc  rcprcscniaiion  of  ihc  mapped  areas  on  the  right  hemisphere  of  cat  cortex.  The  ligliilv  shaded  wunn/lc  approximates  ihc 
mapped  area  (see  Fig.  4  m  Schreiner  and  Mondcison  1990  for  more  details).  The  orientation  of  the  isofrequency  axis  is  indicated  by  the 
course  of  one  representative  isofrcquency  contour  iihin  llnei.  The  iviiiwurv  correspond  to  18  kHz  I  *87-001 ).  10  kHz  i  #87-518).  and  1 1  kHz 
(  #  8V-706).  respectively.  The  dashed  hues  indicate  the  estimated  border  between  primary  auditory  cortex  ( Al )  -  located  dorsal  to  the  border 
-  and  secondary  auditory  cortex  (All).  SS5.  suprasylvian  sulcus.  .d£'5>.  anterior  cctosvivian  sulcus:  PES.  postenor  cciosylviun  sulcus 
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Fig.  3A-C.  Contralateral  response  thresholds.  All  encountered 
threshold  values  are  plotted  as  a  function  of  charactenstic  fre¬ 
quency  iCF)  for  each  of  the  three  cases.  Mean  thresholds  with 
standard  deviations  arc  given  in  Table  I 


frequency  contours  (see  thin  dashed  line  in  the  contour 
plot  of  Fig.  4A)  and  the  overall  pattern  in  the  threshold 
di>;tribution  might  suggest  that  response  thresholds  vary 
s] stcmutically  along  the  isofrequency  domain.  In  this 
case,  at  least  the  middle  and  ventral  portion  of  the  map 
suggest  that  the  global  threshold  distribution  pattern 
may  be  independent  of  the  cochleotopic  gradient.  The 
area  of  lowest  thresholds  includes  the  locution  of  the 
reversal  in  the  dorsoventrul  gradient  of  the  integrated 
excitatory  bandwidth  (thick  dashed  line:  mean  from  Q- 
10  dB  and  Q-40  dB;  Schreiner  and  Mendclson  1990) 
which,  correspondingly,  represents  the  most  sharply 
tuned  area  along  the  isofrcquency  axis  of  AI. 

Aggregates  of  locations  With  higher  or  lower  response 
thresholds  were  also  seen  in  the  two  other  cases  as  dem¬ 
onstrated  in  Fig.  4B.  C.  Doth  maps,  however,  showed  a 
more  complex  distribution  of  high-  and  low-threshold 
aggregates.  In  particular,  the  spatial  frequency  (spatial 
rate  change  t>etween  maxima  and  minima)  of  the 
threshold  distribution  of  the  two  cases  appeared  to  be 
higher  than  in  case  87-518.  High-  and  low-threshold 
aggregates  were  scattered  throughout  the  entire  extent  of 
the  mapped  areas  and  showed  no  clear  spatial  alignment 
with  the  tonotopic  gradient  or  the  isofrequency  axis.  The 
shape  of  areas  with  similar  thresholds  varied  widely. 
Finally,  the  functional/spatial  transitions  from,  for  ex¬ 
ample.  a  low-threshold  to  a  high-threshold  aggregate 
varied  and  could  be  either  gradual  or  quite  steep. 

In  summary,  a  large  range  of  threshold  values  was 
found  even  for  responses  with  similar  CF.  Spatial  ag¬ 
gregates  could  be  delineated  in  AI  that  covered  the  same, 
relatively  narrow  ranges  of  response  thresholds.  The 
spatial  extent  and  location  of  aggregates  with  similar 
response  thresholds  appeared  to  vary  greatly  from  ani¬ 
mal  to  animal,  and  varied  with  regard  to  their  locations 
in  the  isofrequency  domain  of  AI. 


Dynamic  range 

Examples  of  the  dynamic  range  of  rate  level  functions, 
as  dehned  above  (see  Materials  and  methods),  are 
presented  in  Fig.  1  as  the  intensity  range  between  the 
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Table  I.  Siaiisiical  doscnption  ol'  response  threshold,  dynamic  range,  strongest  response  level,  and  ntonotonicity  for  cortical  rate  letel 
lunclions 


Case 

87-518 

87-706 

.Mean  ±  SD 

Conical 
locations  i/i) 

McanxSD 

Cortical 
locations  (n) 

MeanxSD 

Cortical 
locations  l/il 

Threshold  (dB  SPLl 

16.3  zlO.6 

95 

212  1 12.2 

SI 

16.9  X  10.9 

85 

Dvnamic  range  (dBl 

20.8  X  6.5 

86 

19.9  X  8.4 

68 

16.6  X  5.9 

SO 

SRL  idB  SPL) 

35.8  xll.O 

86 

49.4  xl7.l 

77 

39.4  X  16.9 

82 

Monotoniciiv  (?ii  dB) 

“O.Mj:  0.76 

86 

-0.60x  0.90 

67 

-0.30±  0.57 

80 

SKL.  strongest  response  level 


Fig.  4A-C.  Spatial  distribution  of  the  contralateral  response  thresh¬ 
old  in  primary  auditory  cortex  (At)  expressed  in  dB  SPL.  A  The 
upper  portion  depicts  u  pseudo  three-dimensional  projection  ol'  the 
spatial  distribution  of  response  thresholds  In  A1  (case  87-518).  Two 
dimensions  of  the  projection  represent  the  dorsal  -  ventral  (</.  c)  and 
posterior  -  anterior  Ip.  u)  extent  of  the  coitical  surface.  The  third 
dimension,  elevation  of  the  surface,  corresponds  to  the  magnitude 
of  the  functional  parameter  “threshold"  across  the  mapped  cortex. 
An  additional  smoothing  factor  of  0.9  was  applied  to  the  data.  Due 
10  perspective  distortion  in  the  projection,  the  acaU  bar  of  the 
threshold  axis  is  only  accurate  for  the  nearest  comer  of  the  plot.  The 
luwer  portion  of  the  panel  shows  an  undistorted  contour-plot  repre¬ 
sentation  cf  the  same  data.  Each  line  connects  points  of  equal 
threshold,  \umbers  next  to  some  lines  reflect  the  appropriate  values 
of  isuthrcshold  contours.  Contour  intervals  correspond  to  lO-dB 
steps.  The  onentution  of  the  isofrequcncy  domain  of  the  mupiscd 
cortex  IS  indicated  by  the  thin  dashed  line  ( 10  kHz).  The  thick  dashed 
line  indicaies  the  location  of  the  sharpest  frequency  tuning  obtained 
as  the  mean  of  the  highest  Q-IO  dB  and  Q-^dB  values  (Schreiner 
and  Mendelson  1990).  Aid.  dorsal  Al:  Ale.  ventral  Al.  B  Spatial 
dislribuiion  of  threshold  values  iti  Al  (87-706)  represented  us 
pseudo  thrcc-diincnsional  projection.  C  Spatial  distribution  of 
threshold  values  in  Al  (87-001 )  represented  us  pseudo  ihree-dimen- 
sionul  projection.  The  actual  threshold  values  (expressed  in  dU  SPL) 
that  underlie  the  spatial  rccoiisiruction  are  shown  in  the  Inner  panel 
plotted  Ut  then  approximate  cortical  location 
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Fig.  5A-C.  Dynamic  range  of  rale  level  functions.  All  encountered 
dynamic  range  values  are  plotted  us  a  function  of  characterisiic 
frequency  ( CF)  for  each  of  the  three  cases.  Mean  values  with  stan¬ 
dard  deviations  are  given  in  Table  I 


open  and  closed  arrows.  At  all  recording  sites  for  which 
a  dynamic  range  could  be  defined,  values  were  between 
5  and  40  dB.  Mean  and  standard  deviations  of  the  en¬ 
countered  dynamic  ranges  arc  given  in  Table  1.  The 
spread  of  values  was  essentially  independeni  of  the  CF 
(see  Fig.  5). 

The  spatial  distribution  of  the  dynamic  range  values, 
as  shown  in  Fig.  6,  appeared  to  be  quite  idiosyncratic. 
Spatial  aggregates  of  several  recording  locations  with 
similar  dynamic  ranges  were  identified.  However,  the 
representation  of  similar  dynamic  ranges  was  quite  frag¬ 
mented.  leading  to  steep  gradients  between  aggregates 
with  large  or  small  values.  Consequently,  a  relatively 
high  spatial  frequency  in  the  distribution  of  the  function¬ 
al  parameter  range  was  observed.  This  attribute  of  rate 
level  functions  did  not  show  a  clear  systematic  spatial 
disiribuiion  that  appeared  to  be  aligned  with  cither  the 
CF  gradient  or  the  isofrequcncy  axis  and  differed  sub¬ 
stantially  from  case  to  case. 


Xfonotonicity  of  rate  level  functions 


The  distribution  of  monotonicity  values,  i.e..  the  ap- 
pro.ximatcd  slope  of  rate  level  functions  above  the  tran¬ 
sition  point  (sec  Materials  and  methods),  arc  shown  in 
Fig.  7.  A  quantitative  evaluation  of  the  monotomcitv  or 
nonmonotonicity  of  the  obtained  rate  level  functions 
revealed  that  60.5%  of  the  233  locations  for  which  the 
analysis  could  be  completed  were  nonmonotonic  (slopes 
<0%  dB).  Strongly  nonmonotonic  rate  level  func¬ 
tions.  arbitrarily  defined  as  having  slopes  of  less  than 


-  l%  dB.  were  obtained  for  19,3%  of  all  recorded  loca¬ 
tions  or  31.9%  of  all  rate  level  functions  with  negative 
slopes. 

The  spatial  distribution  of  the  degree  of  monotoniciiy 
is  shown  in  Fig.  8,  The  elevation  of  the  grid  surfaces 
corresponds  to  negative  slope  values,  and  therefore  ex¬ 
presses  the  degree  of  /lo/imonotonicity  of  rate  level  func¬ 
tions;  increasing  elevation  represents  increased  values  of 
nonmonotonicity.  In  cases  87-518  (Fig,  8A)  and  87-706 
(Fig.  8B),  two  prominent,  circumscribed  regions  with 
medium  to  high  degrees  of  nonmonotonicity  cun  be  deli¬ 
neated.  The  ventral  urea  of  nonmonotonicity  appeared 
in  both  cases  to  be  spatially  more  extended  than  the 
dorsal  region.  Both  regions  were  separated  by  a  relatively 
narrow  region  containing  locations  with  positive  or  near- 
zero  slope  values.  The  degree  of  monotonicity  varied 
substantially  and  systematically  in  the  isofrequency  do¬ 
main.  whereas  the  variations  in  the  spatial  dimension  of 
the  CF  gradient,  i.e.,  orthogonal  to  the  isofrequcncy 
domain,  appeared  to  be  smaller  and  lacked  a  clear  syste¬ 
matic  organization. 

The  third  case.  87-(X)l,  showed  a  more  complex  spa¬ 
tial  distribution  of  the  monotonicity  of  rate  level  func¬ 
tions  that  appeared  to  dilTer  somewhat  from  the  other 
two  cases.  However,  the  isofrequcncy  contour  was  not 
aligned  with  the  dorsuvcntral  extent  of  the  mapped  area 
but  approximated  the  posterior-ventral  anterior-dorsal 
diagonal  of  the  map  (Schreiner  and  Mendelson  1990:  sec 
also  Figs.  2.  11).  Consequently,  similar  interpretations  of 
the  spatial  distribution  can  be  derived  us  outlined  for  the 
other  two  cases,  i.e.,  an  orderly  progression  in  the  isofre- 
quency  domain  and  orthogonality  to  the  frequency  gra¬ 
dient.  The  only  remaining  difiference  between  this  and  the 
other  two  cases  is  that  no  clearly  expressed  dorsal  ag¬ 
gregate  of  nonmonotonicity  was  observed,  although 
several  suiauer  patches  in  the  dorsal  part  cl  A1  appear 
to  be  highly  nonmonotonic.  All  three  cases  showed  that 
the  ventral  region  of  high  nonmonotonicity  was  closely 
aligned  with  the  region  of  sharpest  frequency  tuning  (sec 
thick  dashed  lines  in  Figs.  8A  and  1 1 ). 


Strongest  response  level 

The  stimulus  level  prtxlucing  the  highest  firing  rate  for 
a  CF  lone  (SRL).  varied  between  10  and  80  dB  SPL.  as 
shown  in  Fig.  9.  For  each  CF.  essentially  the  full  range 
of  SRLs  was  encountered,  showing  a  fairly  even  distribu¬ 
tion  across  the  level  range.  Mean  and  standard  devia¬ 
tions  are  given  in  Table  1. 


Ill 


Fig.  6A-C.  Spatial  disiribuiion  of  the  dynamic  range  in  pnmary 
auditory  corte.x  (Al)  expressed  in  decibels,  A  The  upper  portion 
depicts  a  pseudo  three'diinensional  projection  of  the  spatial  distri¬ 
bution  of  dynamic  ranges  m  Al  lease  87-SI8).  The  lower  portion  of 
the  panel  shows  an  undistorted  contour-plot  represeniuiion  of  the 
same  data.  The  contour  intervals  are  5  dB.  (Sec  Fig.  4  for  further 
details.)  B  Spatial  distribution  of  dynamic  ranges  for  case  87-706. 
C  Spatial  distribution  of  dynamic  ranges  for  case  87-001.  Thc/ower 
portion  of  the  panel  shows  the  obtained  dynamic  range  values  at 
their  approximate  cortical  location 


The  spatial  distribution  of  the  SRL  values  is  shown  in 
Fig.  10.  in  all  three  cases,  spatial  aggregates  of  recording 
locations  with  the  lowest  SRLs  were  encountered  in  the 
dorsoventral  center  of  the  mapped  areas.  Incase  87-518. 
the  SRLs  systematically  increased  toward  the  ventral  and 
dorsal  boundaries  of  Al.  For  the  other  two  cases,  spatial 
aggregates  of  locations  with  medium  and  high  SRLs  were 
evident;  however,  each  spatial  distribution  differed  sub¬ 
stantially  from  the  other  cases  and  showed  no  clear 
pattern  that  appeared  to  be  aligned  with  cither  the  CF 
gradient  or  the  isofrcqucncy  axis.  Note  that  some  of 
the  lowest  SRLs  were  often  found  near  the  region  with 
sharpest  frequency  tuning  (see  thick  dashed  line  in 
Figs.  lOA.  1 1). 


Covariance  of  ratsjlevel  function  and  spectral 
characieristics 

The  parameters  that  were  extracted  from  the  rate  level 
function  to  describe  the  influence  of  stimulus  level  on  the 
response  characteristic  of  cortical  locations  were  not 
necessarily  independent  of  each  other.  The  SRL  of  a 
given  cortical  location,  for  example,  is  in  essence  propor¬ 
tional  to  the  response  threshold,  the  dynamic  range,  and 
the  monotonicity  of  the  ratc/lcvcl  function.  A  correla¬ 
tional  analysis  of  the  studied  parameters  reflects  these 
interdependencies.  Tables  2-5  show  all  parameter  com¬ 
binations  that  were  correlated  at  a  level  of  signiflcance  of 
P<0.05.  In  addition,  the  correlations  of  these  intensity 
parameters  with  those  of  the  spectral  parameters  inte¬ 
grated  eiicitatory  bandwidth  and  transient  response  are 
given. 


ii: 


CF  (kHz) 
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Fig.  7A-C.  The  monoionicity  measure  of  the  high-level  portion  of 
rate  level  functions  (see  Materials  and  methods).  All  encountered 
monotonicity  values  are  plotted  as  a  function  of  characteristic 
frequency  ( CD  for  each  of  the  three  cases.  The  dashtd  line  separates 
locations  with  monotonic  rateTevel  functions  from  locations  with 
nonmonotonic  rate,  level  functions.  Mean  values  with  standard 
deviations  are  given  in  Table  I 


Response  thresholds  (Tabic  2)  showed  the  highest 
and.  across  cases,  most  reliable  correlation  with  the  SRL. 
In  two  of  the  three  cases  this  correlation  was  most  strong¬ 
ly  expressed  in  the  ventral  aspects  of  AI.  A  weak  but 
fairly  consistent  negative  correlation  was  evident  with  the 
excitatory  bandwidth,  especially  with  Q-40  dB  in  AIv. 
i.e..  the  higher  the  threshold,  the  broader  was  the  fre¬ 
quency  tuning.  Particularly  in  case  87-(X)l.  the  strength 
of  the  step  response  was  correlated  with  the  response 
threshold. 

Monotonicity  values  were  consistently  correlated  with 
the  SRLs  across  the  entire  extent  of  Al  (Table  3).  Equally 
reliable  correlations  were  seen  between  the  monotonicity 
and  Q-40  dB  across  Al.  but  especially  in  AIv.  In  addi¬ 
tion.  monotonicity  was  positively  correlated  with  the 


strength  of  step  responses  across  all  of  .AI.  with  the 
exception  of  the  dorsal  aspects  of  case  87-706. 

In  addition  to  correlations  with  response  threshold 
and  monotonicity.  the  SRLs  showed  strongest  correla¬ 
tions  with  the  step  responses,  particularly  in  .Al\ 
(Table  4).  Correlation  between  other  parameters  were 
either  weak  or  not  consistent  across  cases. 


Spuiial  alignment  of  intensity  unci  spectral  parameters 

To  compare  the  spatial  distribution  of  the  studied  re¬ 
sponse  characteristics  with  the  location  of  the  area  of 
sharpest  tuning  in  the  same  cases  (Schreiner  and  Mendel- 
son  1990).  contour  plots  of  the  three  cases  are  given  in 
Fig.  11.  Shaded  areas  in  these  contour  plots  represent 
approximately  the  spatial  locations  of  the  lower  half  of 
the  encountered  parameter  range  for  response  threshold, 
dynamic  range,  and  SRL  (see  Table  1 ).  For  the  represen¬ 
tation  of  monotonicity  values  a  different  criterion  was 
chosen,  namely  that  of  moderate  to  high  nonrnonotonic- 
ity.  Accordingly,  shaded  areas  correspond  to  values  be¬ 
low  -0.5%-dB  for  cases  87-001  and  87-706.  and  below 
-0.49b  dB  for  case  87-518. 

Comparison  of  the  spatial  distributions  of  level- 
related  response  parameters  within  and  across  animals 
suggests  a  liigh  degree  of  idiosyncrasy  in  these  aspects  of 
functional  organization  of  auditory  cortex.  The  spatial 
pattern  of  monotonicity  appeared  to  be  the  most  consis¬ 
tently  organized  of  the  four  parameters  described  here. 
A  cirr’imscribcd  area  of  moderate  to  high  non¬ 
monotonicity  was  reliably  encountered  in  the  center  of 
high-frequency  AI  along  the  main  CF  gradient  and  coin¬ 
cided  with  the  area  of  narrowest  integrated  excitatory 
bandwidth.  These  elongated  patches  of  high  non¬ 
monotonicity  were  arranged  orthogonal  to  the  isofre¬ 
quency  domain  and.  although  they  showed  interrup¬ 
tions.  appeared  to  cover  the  entire  studied  frequency 
range.  Smaller,  patchy  aggregates  of  nonmonotonic  rate 
level  functions  were  consistently  encountered  dorsal  to 
the  main  area  of  nonmonoionicity.  The  spatial  distribu¬ 
tion  of  the  SRL  measure  showed  a  similar,  although 
slightly  more  complex,  pattern.  Since  monotonicity  and 
SRL  were  the  most  strongly  correlated  parameters,  this 
spatial  covariance  was  to  be  expected. 

For  all  three  cases,  the  response  threshold  values  were 
moderately  correlated  with  SRL  values  (see  Table  2).  The 
spatial  distribution  of  response  thresholds  appeared  to  be 
more  erratic  and  showed  some  deviation  from  the  general 
pattern  of  alignment  with  the  CF  gradient  and  the  sharp¬ 
ness  of  tuning  gradient  as  seen  for  monotonicity  and.  to 
a  lesser  degree,  SRL. 

Finally,  the  spatial  pattern  derived  for  the  dynamic 
range  of  rate/level  functions  showed  the  least  resem¬ 
blance  to  the  spatial  distributions  of  the  other  parameters 
under  consideration,  thus  reflecting  the  virtual  absence 
of  correlation  with  those  parameters  (Table  5). 


-1 


Fig.  8A-C.  Spatial  distribution  of  the  degree  of  monotonicity  en¬ 
countered  in  primary  auditory  cortex  (Al)  expressed  in  percentage 
per  dccibeis.  A  The  upper  poriion  dcp'cts  a  pseudo  three-dimension¬ 
al  projection  of  the  spatial  distribution  of  nonmonotonicity  in  AI 
(case  87-518).  Note  that  negative  values,  i.e..  the  degree  of  non- 
monotonicity,  are  proportional  to  the  elevation  of  the  grid  surface. 
The  lower  portion  of  the  panel  shows  an  undisiorted  contour-plot 
representation  of  the  same  data.  The  contour  intervals  are  0.4%  dB. 
(See  Fig.  4  for  further  detatls.)  B  Spatial  distribution  of  dynamic 
ranges  for  case  87-706.  C  Spatial  distribution  of  dynamic  ranges  for 
case  87-001.  The  lower  portion  of  the  panel  shows  the  obtained 
dynamic  range  values  at  their  approximate  cortical  location 


Table  2.  Correlation  of  contralateral  mini- 
muni  threshold  with  cortical  response 
measures  obtained  at  the  same  locations 


Case 

AI 

Aid* 

Alv* 

r 

P 

r 

P 

r 

P 

MON 

87-518 

0.32 

0.01 

0.56 

0.0001 

SRL 

87-516 

0.62 

O.OOOl 

0.78 

0.0001 

87-706 

0.35 

0.001 

0.33 

0.04 

87-001 

0.48 

0.0001 

0.55 

0.0001 

DR 

87-001 

-0.49 

0.006 

Q-10  dB 

87-001 

-0.29 

0.004 

-0.28 

0.03 

dB 

87-518 

-0.26 

0.02 

-0.34 

0.05 

87-00 i 

-0.25 

0.02 

-0.48 

0.003 

Step 

87-516 

0.35 

0.04 

87-001 

0.36 

0.0004 

0.26 

0.05 

0.41 

0.02 

Linear  regression  analysis;  r —  correlation  coePTicient;  Pm  level  of  significance  (F-test) 

'  Aid  represents  recording  location  in  the  portion  of  Al  dorsal  to  the  most  sharply  tuned 
multiple  unit  responses  (Schreiner  and  Mendelson  1990) 

AIv  represents  location  ventral  to  the  most  sharply  tuned  region  of  Al 
MON.  monotonicity;  SRL.  strongest  response  level:  DR.  Dynamic  range;  Step,  response 
strength  to  a  rapid  frequency  step  from  0.2  to  64  kHz.  i.e..  broadband  transient  (Schreiner 
aiid  Mendelson  1990) 
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Fig.  9A-C.  The  strongest  response  level  (SRL)  of  rule  level  func¬ 
tions  (see  Materials  and  methods).  All  encountered  SRL  values  arc 
plotted  as  a  function  of  characteristic  frequency  iCF)  for  each  of 
the  three  '  uses.  Mean  values  with  standard  deviations  arc  given  in 
Table  1 


Discussion 

Response  thresholds  and  rate  level  functions  for  CF 
tones  were  determined  in  the  primary  auditory  cortex  of 
anesthetized  cats  using  the  multiple-unit  recording 
technique  and  related  to  the  estimates  of  integrated  ex¬ 
citatory  bandwidth  obtained  at  the  same  locations.  Three 
descriptive  and  functionally  significant  parameters  were 
e.xtracted  from  the  rate,  level  functions,  namely  dynamic 
range.  SRL.  and  monotonicity.  The  spatial  distribution 
of  these  parameters  across  the  high-frequency  portion  of 
AI  was  then  determined.  In  addition  to  the  four  par¬ 
ameters  described  in  this  report,  eight  other  parameters 
were  studied  at  the  same  locations  in  the  same  animals 
in  order  to  determine  the  global  distribution  of  response 
properties  in  AI.  A  major  goal  of  this  study  was  to 
compare  and.  potentially,  relate  the  distributions  of  the 
obtained  response  properties  to  each  other.  Since  it  is 
difficult  to  obtain  maps  for  such  a  large  number  of 
stimulus/response  parameters,  only  the  three  most  com¬ 
plete  cases  were  included  in  this  and  a  previous  report 
that  described  the  spatial  distribution  of  four  spectral 
parameters  (CF.  Q-10  dB.  0-40  dB.  and  transient  re¬ 
sponse  strength;  Schreiner  and  Mcndelson  1990),  Inter¬ 
pretations  and  discussions  of  methodological  influences 
will  be  given  for  each  of  the  studied  parameters  separate¬ 
ly  followed  by  a  brief  comparison  with  the  spatial  distri¬ 
butions  of  other  parameters. 


Response  threshold 

The  obtained  range  of  contralateral  response  thresholds 
of  approximately  SO  dB  encountered  in  this  study  is  in 
close  agreement  with  threshold  ranges  seen  in  previous 
multiple-unit  (Schreiner  and  Cynadcr  1984)  and  single- 
unit  studies  (Phillips  and  Irvine  1981)  of  AI.  In  two  of 
the  three  cases,  the  lowest  thresholds  were  essentially 
independent  of  CF  in  the  tested  ranges  of  0,65  and  1,57 
octaves.  In  the  third  case  (87-001).  CFs  between  12  and 
14  kHz  had  thresholds  approximately  25  dB  above  the 


Table  3.  Correlaiion  of  monotoniciiy  of 
rate  level  functions  with  cortical  response 
measures  obtained  at  the  same  locations 


Case 

AI 

Aid 

AIv 

r 

P~ 

r 

P 

r 

P 

THR 

87-518 

0,32 

0.01 

0.56 

0.0(811 

SRL 

87-518 

0.46 

0.0001 

0.36 

0.03 

0.41 

0.04 

87-706 

0.58 

0.0001 

0.63 

0.0001 

0.51 

0.002 

87-001 

0.54 

0.0001 

0.51 

O.OOOl 

0.52 

0.003 

DR 

87-706 

-0.24 

0.03 

-0.47 

0.004 

CMOdB 

87-001 

-0.29 

0.004 

-0.28 

0.03 

0-40  dB 

87-518 

-0.50 

0.001 

-0.35 

0.03 

-0.56 

0.003 

87-706 

-0.29 

0,008 

-0.34 

005 

87-001 

•0.25 

0.02 

-0.48 

0.003 

Step 

87-518 

0.50 

0.0001 

0.53 

0.0006 

0.43 

003 

87-706 

0.49 

0004 

87-001 

0.39 

0.0002 

0.35 

0.01 

0.48 

0.008 

THR.  minimum  response  threshold.  For  further  explanations  see  Table  2 


115 


6o4  5 


20-1  £g 


M7-001 


FIK'  lOA-C.  Spuliul  disiribulion  of  the  strongest  response  levels 
(SRLsj  encountered  in  primary  auditory  cortex  (AI)  expressed  in 
dll  SPL,  A  The  upper  portion  depicts  u  pseudo  three-dimensional 
projection  of  the  spuiiul  distribution  ofSRLs  m  Al  (case  87-SI8). 
The  loner  portion  of  the  panel  shows  an  undistorted  contour-plot 
rcpreseniation  of  the  same  data.  The  contour  intervals  arc  10 dB. 
(.Sec  I’ig.  4  for  t'urthcr  details. i  P  ”  atial  distnbution  ofSRLs  for 
case  87-700.  C  Spatial  disf'’  "o  •  if  SRLs  for  case  87-001.  The 
lon  er  portion  of  the  p.'t  '  ibtained  SRL  values  at  tbcir 

approximate  cortica'  i  a'l- 
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Table  4.  Correlation  o  ‘ti'ongest  response 
let  el  vtiih  cortical  response  measures  ob¬ 
tained  at  the  Same  locations 


Case 

AI 

Aid 

Als 

r 

P 

r 

P 

r 

P 

THR 

87-516 

0.u2 

0.0001 

0.78 

n.otxii 

87-706 

0..15 

0.001 

0.33 

0.04 

87-001 

0.48 

0.0001 

0.55 

0.0001 

MON 

87-518 

0.46 

0.0001 

0.36 

0.03 

041 

0.04 

87-706 

0.58 

0.0001 

0.63 

0.0001 

Oil 

0  18)2 

87-001 

0.54 

0.0001 

0,51 

0.0001 

0.52 

l).0!)3 

Q-IO  dB 

87-706 

-0.28 

0.01 

-0.47 

0.003 

0-40  dB 

87-518 

-0.28 

0.02 

-0.42 

0.02 

87-706 

-0.35 

0.001 

Step 

87-518 

0.26 

0.02 

0.45 

n.oox 

87-706 

0.26 

0  02 

0.60 

O.OOOI 

87-001 

0.42 

0.0001 

0.39 

0.003 

0.47 

0,005 

For  explanations  sec  Tables  2.  J 


threshold 

<15dBSPL 


dynamic  range 
<20dB 


strongest  level 
<AQiB  SPL 


monotonicity 
<-OJi  VdB 


Fig.  1 1.  Comparison  of  the  spatial  distribution  of  response  parame¬ 
ters  in  pnmars  auditors  cortex  (All.  Contour  plots  of  the  value 
distributions  arc  shots n  for  threshold,  dynamic  range,  strongest 
response  level  (SRL).  and  monotonicity.  For  the  first  three  parame¬ 
ters.  the  shaded  areas  correspond  to  the  approximate  lower  half  of 
the  paramter  range  for  each  case.  For  the  monotonictty  distribu¬ 
tions.  the  shaded  areas  represent  values  of  moderate  to  high  non- 
monotonicity.  The  ihin  dashed  lines  indicate  the  orientation  of  the 


isofrcquency  axis.  The  ihiek  dashed  line  separates  dorsal  .Al  t.-l/i/l 
from  ventral  A1  (.-l/r).  as  suggested  by  a  reversal  in  the  overall 
g'^adient  of  inte;.  'ted  excitatory  bandwidth.  It  represents  the  mean 
location  of  the  sharpest  frequency  tuning  lO  and  4i)dB  above 
minimum  threshold  and  closely  approximates  the  locations  of  low¬ 
est  response  strength  to  a  broadband  transient  i  Schreiner  and  Mcn- 
delson  1990) 
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Table  5.  Correlation  of  the  dvnamie  range 
of  rate  level  funeuons  with  cortical  re¬ 
sponse  measures  obtained  at  the  same 
locations 


Case 

.\I 

Aid 

Ah 

f 

P 

r 

p 

r 

P 

THR 

87-001 

-OM 

0.006 

MON 

87-706 

-0.24 

0.03 

-0.47 

O.IKM 

Q-IO  dB 

87-001 

0.25 

0.02 

0.47 

0.01 

For  explanations  see  Tables  2.  3 


lowest  values  encountered  for  that  animal.  The  in  vitro 
calibration  of  the  sound  system  accounted  only  for  max¬ 
imal  deviations  of  less  than  half  of  that  value.  Since  the 
sound  delivery  system  was  not  tested  in  vivo,  direct 
contributions  of  the  transfer  function  to  this  discrepancy 
cannot  be  completely  excluded.  However,  another  pos¬ 
sible  explanation  for  this  reduced  range  of  sensitivities  is 
provided  by  considering  the  oblique  orientation  of  this 
mapped  cortical  area  relative  to  the  orientation  of  the 
isofrequcncy  axis  (Fig.  1 1 ;  see  also  Fig,  4  in  Schreiner 
and  Mendelson  1990).  As  a  consequence  of  this  oblique 
orientation,  certain  frequency  ranges,  including  that 
from  12  to  14  kHz.  were  not  mapped  along  the  entire 
lengths  of  the  isofrequency  domain.  Since  cortical  loca¬ 
tions  with  similar  thresholds  appear  to  be  spatially  clus¬ 
tered.  it  is  conceivable  that  low-threshold  areas  were 
missed  for  some  CF  ranges  located  in  the  comers  of  the 
mapped  area. 

No  clear  influence  of  the  multiple-unit  recording  on 
the  threshold  values  is  apparent  when  comparing  the 
encountered  values  with  those  from  single-unit  record¬ 
ings  (Phillips  and  Irvine  1981).  Response  thresholds  of 
cluster  recordings  always  correspond  to  the  most  sen¬ 
sitive  neurons  within  that  cluster  (or  the  most  sensitive 
thalamocortical  contribution).  The  threshold  range  of 
neurons  contributing  to  a  cluster  response  cannot  un¬ 
equivocally  be  estimated  from  the  multiple-unit  response 
alone.  However,  if  the  threshold  range  within  any  cluster 
was  fairly  large,  say  on  the  order  of  30-60  dB.  this  effect 
could  express  itself  in  relatively  large  dynamic  ranges  for 
the  multiple-unit  recordings,  and  a  shift  of  the  average 
threshold  in  single-unit  recordings  relative  to  the  multi¬ 
ple-unit  thresholds.  Neither  of  these  consequences  are 
apparent  in  the  sample  of  this  study,  suggesting  that  the 
scatter  of  response  thresholds  of  units  within  the  record¬ 
ed  clusters  was  generally  below  30-40  dB. 

The  spatial  distribution  of  response  thresholds  indi¬ 
cates  a  spatial  clustering  of  locations  with  similar  re¬ 
sponse  thresholds.  The  shapes  of  those  aggregates  ap¬ 
peared  to  tack  uniformity.  In  ail  three  cases,  at  least  one 
low-threshold  aggregate  was  encountered  near  or  at  the 
dorsovcniral  center  of  AI  as  judged  by  the  length  of  the 
isofrequency  domain,  and  the  locations  of  maxima  in  the 
sharpness  of  multiple-unit  tuning  cu.  es  as  reported  in 
the  first  part  of  this  study  (Schreiner  and  Mendelson 
1990). 

In  a  previous  multiple-unit  study  of  AI  and  All 
(Schreiner  and  Cynader  1984).  the  average  threshold  of 
locations  with  excitatory, exciiatorv  binaural  interaction 
were  5-10  dB  higher  than  those  lor  locations  with  cx- 
citaiory  inhibiiory  binaural  interaction.  This  average 


threshold  difference  between  different  binaural  interac¬ 
tion  types  is  small  compared  to  the  overall  range  of 
encountered  threshold  values.  Consequently,  it  should 
only  have  a  relatively  small  influence  on  the  global  spatial 
distribution  pattern  of  tnreshold  values  in  AI. 

The  obtained  range  and  distribution  of  response 
threshold  values  within  the  isofrequency  domain  of  AI 
suggests  that  response  thresholds  may  contribute  to  a 
systematic  organization  of  the  cortical  processing  of  sig¬ 
nal  intensity.  Systematic  distributions  of  response  thresh¬ 
olds  have  txen  earlier  described  in  the  primary  auditory 
cortex  of  the  dog  (Tunturi  1952)  and  the  mustached  bat 
(Suga  and  Manabc  1982). 

Dynamic  range 

The  mean  value  for  the  dynamic  range  of  rate  level  func¬ 
tions  obtained  in  these  multiple-unit  maps  of  AI  was 
19 1'  7  dB.  with  all  locations  having  a  dynamic  range  less 
than  or  equal  to  40  dB.  For  single  units  tested  with 
tone-onset  times  of  10  ms,  Phillips  and  Irvine  (1981) 
reported  81  %  to  have  dynamic  ranges  below  40  dB.  The 
average  dynamic  range  of  single  cortical  neurons  ob¬ 
tained  with  comparable  tone-onset  times  as  used  in  this 
study  was  in  the  range  of  10  ±  3  dB  (Phillips  1988)  or.  in 
another  study,  1 7  dB  for  monotonic  neurons  and  l2.3dB 
for  nonmonotonic  neurons  (Phillips  and  Hall  1986).  The 
slightly  higher  estimate  of  the  dynamic  range  for  multi¬ 
ple-unit  recordings  is  not  unexpected  in  light  of  the  argu¬ 
ments  of  the  threshold  composition  of  recorded  unit 
clusters  given  above.  The  increase  in  the  average  dynamic 
range  for  multiple  units  by  less  than  10  dB  suggests  that 
the  threshold  disparity  within  some  clusters  may  be  even 
less  than  the  30-  to  40-dB  estimate  given  above.  How¬ 
ever.  other  influences  on  the  response,  such  as  inhibitory 
interactions  within  a  cluster,  have  to  be  taken  into  con¬ 
sideration  to  completely  account  for  the  observed  level 
range  of  the  initial  rapid  growth  in  rate  level  functions. 
Of  the  four  parameters  under  consideration,  the  de¬ 
rived  dynamic  range  values  were  technically  the  most 
vulnerable  to  random,  nonsystemaiic  influences.  This  is 
largely  due  to  the  step  size  of  the  intensity  levels  used  in 
the  analysis  (5  dB)  which  resulted  in  a  small  number  of 
possible  dynamic  range  values  (6-8).  An  error  of  one  step 
size  in  the  estimate  represents  deviations  between  1 1  %  for 
large  dynamic  ranges  and  100%  for  small  dy ramie 
ranges.  The  spatial  distribution  of  dynamic  ranges  in¬ 
dicated  some  spatial  aggregates  of  similar  values.  The 
distribution  pattern  appeared  to  be  quite  idiosyncratic 
and  showed  no  clear  alignment  with  distribution  patterns 
of  the  other  intensity  parameters. 
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Momionicity 

A  major  physiological  distinction  among  auditors  coni¬ 
cal  neurons  is  the  monotonicity  or  nonmonotomcits  of 
their  rate  level  functions  in  response  to  tones  (Brugge  et 
al.  1969,  lirulkaret  al.  1956;  Evans  and  Whitfield  1964; 
Phillips  and  Hall  1986;  Phillips  and  Irvine  1981 ;  Phillips 
01  al.  1985).  Noninotioionic  units  have  a  distinct  ma.Ki- 
mum  in  the  tiring  rate  for  a  certain  stimulus  level.  Phillips 
and  Irvine  1 1981)  have  reported  that  25%  ofsingle  uims 
in  Al  had  nonmonotonic  rale  level  functions.  In  another 
single-unit  study  of  Al  (Phillips  and  Hall  1986).  38.5%  of 
the  rate  level  functions  were  classified  as  nonmonotonic. 

In  the  cuiTciii  study,  the  degree  of  monotonicity  or 
nonmonotomcity  was  assessed  in  a  somewhat  ditTcreiu 
way  than  in  the  studies  mentioned  above,  by  obtaining 
the  slope  of  the  rate  level  function  above  the  transition 
point,  i.c..  after  the  rapid  growth  phase  of  the  function. 
As  a  result,  one  derives  a  continuous  measure  of  the  rate 
of  change  of  that  portion  of  the  rate  level  function.  This 
measure  can  be  estimated  without  having  to  rely  on  the 
firing  rate  at  the  high  level  end  of  the  ratedevel  function, 
as  is  necessary  for  other  measures  (Phillips  cl  al.  1985; 
Imig  ct  al.  1990).  The  slope  measure  can  also  quantify 
slow  positive  growths  of  the  high  level  portion  that  are 
not  discerned  by  those  measures.  Applying  the  slope 
measure  of  monotonicity  to  a  few  examples  ol' published 
single  unit  rate  level  functions  (Phillips  ct  al.  1985:  Phil¬ 
lips  and  Hull  1986;  Phillips  1988)  resulted  in  slopes  rang¬ 
ing  from  +  1.56%idB  to  -8.7%/dO  (N-  16).  These  re¬ 
evaluated  single  unit  rate/lcvcl  functions  had  been  ob¬ 
tained  with  lone  rise  times  between  2.5  and  10  ms.  thus 
bracketing  the  rise  time  of  3  ms  used  in  this  study. 
Changes  in  rise  limes  have  been  shown  to  inliuence  the 
slope  of  the  high  level  portion  and.  therefore,  to  alter  the 
degree  of  noiimonotonicity  (Phillips  1988), 

In  the  current  multiple-unit  study.  60.5%  of  the  loca¬ 
tions  showed  nonmonotonic  rale, 'level  functions  as 
judged  by  a  negative  slope  of  the  high-level  portion,  a 
clear  deviation  from  the  pattern  seen  for  single  units. 
This  proportion  of  nonmonotonic  locations  cun  be  low¬ 
ered  to  approximately  20%  by  using  a  more  strict 
criterion  for  manifest  noiimonotonicity,  for  example 

-  1%  dll.  thereby  excluding  those  moderately  negative 
slope  estimates  that,  at  least  partially,  could  be  attributed 
to  some  random  scatter  in  the  course  of  otherwusp  sat¬ 
urated  rate  level  functions.  However,  no  unequivocal 
benchmarks  for  selecting  an  appropriate  objective 
criterion  for  noiimonotonicity  were  evident. 

.Another  feature  of  the  obtained  monotonicity  values 
that  IS  at  variance  with  single-unit  data  is  that  the  range 
of  the  vast  majority  of  multiple-unit  slopes  was 
compressed,  '  alucs  ranged  only  from  +  1,5%  dB  to 

-  2.5%  dB.  thus  reducing  the  negative  slope  range  to  less 
than  a  third  of  that  found  for  single  units. 

The  main  reason  for  these  discrepancies  between  sin¬ 
gle-  and  multiple-unit  results  is  likely  related  to  inhcreiu 
properties  of  the  multiple-unit  recording  technique, 
namely  the  integration  of  the  firing  rate  from  several 
units  near  a  given  cortical  location.  In  order  to  observe 
nonmonotoiiicities  in  the  multiple-unit  recording  that  are 


just  as  large  as  observed  for  single  units,  ail  units  in  the 
recorded  cluster  have  to  possess  the  same  degree  of  non- 
inonotoniciiy.  the  same  response  threshold,  and  the  same 
dynamic  range.  On  the  other  hand,  it  is  less  likely  that 
a  cluster  containing  only  nonmonotonic  units  that  ditVer 
m  those  other  properties  would  result  in  a  monoiomc 
rate  level  function.  The  consequence  is  that  clusters  with 
predominantly  nonmonotonic  units  will  also  appear  as 
nonmonotonic  in  the  multiple-unit  recording,  however, 
with  a  reduced  degree  of  nonmonotoniciiy.  Clusters  that 
contain  only  moiiotonic  units  will  be  recorded  us  mono- 
tonic.  possibly  with  a  small  to  moderately  large  positive 
slope.  The  slope  of  rate  level  functions  for  clusters  that 
contain  about  the  same  number  of  monoiomc  and  non¬ 
monotonic  units  will  likely  be  negative.  The  magnitude 
of  the  negative  slope  will  probably  be  small  and  will  be 
determined  by  the  amount  of  nonmonotonicity  as  well  as 
by  the  threshold  and  dynamic  range  distributions  of  the 
units.  In  other  words,  the  measure  of  monotonicity  as 
applied  in  this  study  reflects  the  degree  of  homogeneity 
in  the  rate,  level  functions  of  the  contributing  neurons  as 
well  as  the  global  nature  of  the  level-dependent  behavior 
of  the  units  at  the  high  and  low  ends  of  the  monotonicity 
scale. 

Phillips  et  al.  (1985;  Fig,  8)  demonstrated  an  example 
of  the  spatial  distribution  of  munotoiiicity  at  locations  in 
auditory  cortex.  Of  the  58  locations  reported  for  that 
case.  29.3%  showed  only  cells  with  monotonic  rate  level 
functions,  and  48.3%  exclusively  contained  non- 
monotonie  cells.  At  the  remaining  22.3%  locations, 
monotonic  as  well  as  nonmonotonic  cells  were  encoun¬ 
tered  ("mixed"  composition).  Combining  locutions  of 
nonmonotonic  cells  with  those  of  mixed  composition 
results  in  70.6%  locations  that  could  show  some  degree 
of  nonmonotonicity  if  sampled  with  a  multiple-unit  re¬ 
cording,  This  percentage  is  in  closer  agreement  with  the 
percentage  of  nonmonotonic  locations  (6U.5%)  found  in 
the  present  study  than  with  the  distribution  based  on 
single  units  that  arc  not  topographically  localized  (sec 
above). 

The  spatial  distribution  of  the  monotonicity  measure 
across  the  mapi^cd  auditory  cortices  consistently  showed 
aggregates  of  locutions  with  high  nonmonotonicity  and 
usually  smaller  areas  only  containing  locations  with  rate 
level  functions  of  positive  slope.  In  most  cases,  these 
ureas  were  separated  by  locations  with  slopes  neui  zero, 
suggesting  gradual  transitions  between  predominantly 
monotonic  and  predominantly  nonmonotonic  portions 
of  Al.  Indications  of  spatial  segregation  of  monoiomc 
and  nonmonotonic  units  in  Al  have  been  given  previous¬ 
ly  (Realc  ct  al.  1979;  Phillips  et  al.  1985).  Phillips  and 
colleagues  (1985)  also  mentioned  a  "horizontal"  segrega¬ 
tion  of  the  area  of  nonmonotonicity  from  locations  with 
monotonic  projxrtics  corresponding  to  the  rosirocaudal 
orientation  of  the  elongated  central  nonmonotonic  area 
observed  in  all  three  cases  in  this  study.  In  two  of  the 
three  cases  described  here,  a  second,  noncontiguous, 
smaller  region  of  nonmonotonicity  was  observed  that 
was  located  dorsal  to  the  main  area  of  nonmonotonicitv. 

For  a  functional  interpretation  of  the  segregation  of 
areas  with  high  monotonicity  and  high  noniv.onotomc- 
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ii>.  several  aspects  have  to  be  considered.  The  obtained 
range  and  spatial  patierr,  of  monotonicity  values  within 
the  isofrcquencv  domain  provides  further  evidence  that 
the  processing  of  signal  intensity  may  be  systemutically 
organized  and  distributed  in  the  primary  auditory  cortex. 
Regions  of  high-level  selectivity  (strongly  nonmonoton¬ 
ic)  and  regions  of  low-level  selectivity  (monotonic)  are 
nonrandomly  superimposed  over  regions  with  high-  or 
low-level  scnsiiiviiy.  thus  providing  a  number  of  inicn- 
sity-spccilic  parameter  combinations  that  are  potentially 
of  lunct'onal  significance, 

Another  aspect  of  the  monotonicity  issue  is  that  a  high 
degree  of  nonmonotonicity  also  retkcis  the  involvement 
of  a  high  degree  of  inhibitory  activity  (Suga  and  Manabc 
1982:  Phillips  and  Cynader  1985;  Phillips  ct  ul.  1985). 
This  covuriunee  has  consequences  for  the  appropriate 
spectral  and  temporal  content  of  signals  that  arc  optimal¬ 
ly  matched  for  those  locations  (Phillips  1988).  i.e..  it 
provides  construiius  for  the  possible  "information-bear¬ 
ing  puramoiers"  represented  at  those  locations. 

Sirtmyesi  responxv  level 

The  stimulus  level  eliciting  the  highest  firing  rate  ofa  unit 
or  cluster  response  is  referred  to  us  “most  ctrcctivc  sound 
pressure  level"  (Urugge  and  Mcrzcnich  1969)  or  "best 
amplitude"  (Suga  1977)  and  implies  some  tuning  to  that 
level.  The  measure  referred  to  here  as  "strongest  response 
level"  combines  all  of  the  aspects  of  ratedevcl  functions 
discussed  above  -  response  threshold,  dynamic  range, 
and  iiioiiotonicity  -  and  assigns  a  single  variable.  Al¬ 
though  the  leduciion  of  several  properties  to  u  single 
value  represents  a  simplification  of  these  physiological 
issues,  the  procedure  is  useful  since  it  provides  a  unified 
picture  that  is  easily  referred  to.  is  expressed  in  units  (dB 
SPL)  that  directly  relate  to  well-known  physical  proper¬ 
ties  of  the  signal,  and  is  probably  closely  related  to 
intensity  coding  (Pfingstctal.  1977;  Suga  l977;Sugaand 
Manabc  1982). 

The  SRL  expresses  a  condition  ot  maximal  excitatory 
activity  of  u  single  unit  or  a  group  of  neuron';.  This 
condition  may  or  may  not  be  of  a  specific  functional 
significance  for  a  group  of  neurons  whose  individual 
properties  may  vary  (see  above).  However,  as  long  as  the 
specific  properties  of  local  cortical  circuitry  and  the  con¬ 
tributions  from  difTerenl  typ>cs  ot'neu’'ons  arc  not  known, 
this  measure  should  sufFicc  in  assessing  the  global  disfi- 
bution  of  local  states  of  activity. 

A  disadvantage  that  has  to  be  hcpl  in  mind  in  using 
the  measure  of  SRL  is  that  it  docs  not  express  the  degree 
of  selectivity  for  that  specific  level  condition.  For  strong¬ 
ly  nonmonotonic  units,  a  SRL  is  highly  significant  (Suga 
and  Manabe  1982).  For  monotonic  units,  a  SRL  may 
also  be  derived,  c.g..  as  the  highest  level  in  units  with 
slightly  increasing  ratc/lcvcl  functions  or  as  the  average 
level  of  the  saturated  portion  in  a  ratc/lcvcl  (unction. 
However,  the  level  selectivity  m  the  two  latter  eases  is  low 
and  the  measure  becomes  functionally  less  signiticani 
unless  it  is  viewed  in  combination  with  other  measures 
that  more  spccilically  express  the  local  prop/erlics. 


Considering  that  the  highest  levels  used  in  this  studs 
did  not  usually  exceed  85  dB  SPL.  the  range  of  SRLs 
observed  in  this  study  (10-80  dB  SPL)  is  in  reasonable 
agreement  with  the  range  found  for  best  levels  for  single 
units  in  the  auditory  cortex  of  the  cat  (0-65  dB  SPL.  Imig 
et  tl.  1990;  or  12-90  dB  SPL.  as  mtt.-acicd  from  data 
published  by  Phillips  1988;  Phillips  and  Cynader  1985; 
Phillips  and  Hall  1986.  1987;  and  Phillips  et  al.  1985. 
1989).  the  macaque  (15-95  dB  SPL.  Brugge  and  Mer- 
zenich  1973).  and  the  mustached  bat  (15-98  dB  SPL. 
Suga  1977;  Suga  and  Manabe  1982).  The  relativels  even 
distribution  of  the  SRL  values  across  that  range  m  com¬ 
bination  with  the  existence  ofa  significant  percentage  of 
monotonic  rate,  level  functions  provides  evidence  for  rel¬ 
atively  equal  signal  processing  capacities  at  all  intensities 
in  the  normal  hearing  range. 

Similar  to  the  other  intensity-related  measures.  SRLs 
were  not  randomly  distributed  across  AI  but  showed 
aggregates  of  high,  medium,  or  low  SRLs.  One  major 
rostrocaudally  elongated  aggregate  of  lower  SRL..s  war- 
located  approximately  in  the  dorsovcntral  center  of  Al 
oriented  orthogonally  to  the  isofrequency  domain.  The 
dorsal  half  of  AI  was  usually  dominated  by  higher  SRLs. 
although  smaller  clusters  of  lower  SRLs  were  also  ob¬ 
served.  In  the  ventral  portion  of  two  maps.  SRLs  around 
60  dB  SPL  dominated. 

The  consistent  correlation  between  SRL  and  thresh¬ 
old  (see  also  Suga  and  Manabc  1982)  as  well  as  monoto¬ 
nicity  is  rcllccted  in  the  spatial  distribution  of  the  SRL. 
Sui>criniposing  the  shaded  areas  in  the  threshold  and 
monotonicity  contour  maps  of  cases  87-518  and  87-706 
provides  a  good  prediction  of  the  distribution  of  low 
SRLs.  However,  the  prediction  falters  in  areas  that  arc 
not  highly  nonmonotonic  and  arc  therefore  not  par¬ 
ticularly  level  selective. 

The  observation  that  essentially  the  full  range  of  pos¬ 
sible  SRLs  was  encountered  in  a  somewhat  systematic 
fashion  along  a  given  isofrcquciicy  contour  suggests  ihi: 
cxistfiiicc  of  an  ampliotopic  representation  in  cut  Al 
similar  to  that  seen  in  the  rnustached  bat  (Suga  and 
Manabc  1982). 


Comparison  of  spatial  characteristics  of  A I 

Frequency  organization.  The  most  prominent  spatial  pat¬ 
tern  in  ilic  physiological  organization  of  AI  is  the  coch- 
leotopic  gradient  tliat  unequivocally  defines  its  rostral 
and  caudal  extent  (c.g  .  Mcrzcriich  et  al,  1975,  Reale  and 
Imig  1980).  The  dorsovcntral  extent  of  the  isofrcquencv 
domain  is  marked  by  a  more  or  less  gradual  decline  of 
the  prccisioii  in  tht  frequency  organization  toward  the 
presumed  boundaries  of  A I  (Mcrzcnich  ct  al.  1975; 
Midd'ebrooks  and  Zook  1984;  Reals  and  Imig  1980; 
Schrci.Ticr  and  Cynader  1984;  Schreiner  and  Mcndelson 
1990).  Just  as  in  the  cyioarchitccturc  (Rose  1949:  Winer 
1984)  or  thalamocortical  projection  pattern  (Andersen  et 
al.  1980;  Drandner  and  Redis  1990;  Imig  and  Morel 
1984;  Middlcbrooks  and  Zook  1984;  Morel  and  Irr.ig 
1987)  the  CF  organization  docs  not  provide  sharp  and 
unequivocal  boundary  criteria  foi  ihc  dorsovcntral  ex- 
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tent  of  AI.  The  distribution  of  physiological  response 
parameters  other  than  frequency  could  possibly  provide 
the  desired  information.  However,  examination  of  the 
maps  in  this  report  does  not  indicate  clear  parametric 
criteria  that  would  justify  their  sole  use  in  determining 
the  dorsoventral  boundaries  of  .AI.  Most  changes  in  the 
spatial  distribution  of  parameters  are  relatively  gradual, 
and  in  the  dorsal  or  ventral  portion  of  AI.  none  of  the 
four  parameters  showed  a  clear  functional  demarcation 
that  continuously  extended  across  the  rostrocaudal  di- 
iT>ension  of  the  middle  portion  of  the  ectosylvian  sulcus. 

Spi’circil  hancinidih.  In  a  previous  report  (Schreiner  and 
Merdelson  1990).  the  spatial  distributions  of  the  integra¬ 
tive  excitatory  bandwidth  and  the  response  to  transient 
signals  had  been  documented  for  the  same  cortices  ex¬ 
plored  in  this  study.  The  spatial  distribution  of  the  inte¬ 
grated  excitatory  bandwidth,  expressed  as  Q-10  dB  and 
Q-40  dB  of  multiple-unit  tuning  curves,  revealed  an  area 
of  sharp  tuning  in  the  dorsoventral  center  of  AI  or¬ 
thogonally  oriented  to  the  isofrcqucncy  domain 
(Schreiner  and  Mendelson  1990)  with  a  more  or  less 
gradual  decrease  of  the  integrated  bandwidth  toward  the 
dorsal  and  ventral  boundaries  of  AI.  On  the  basis  of 
these  multiple-unit  gradients,  as  well  as  the  responses  of 
single  neurons  (Sutter  and  Schreiner  1991a).  AI  was 
divided  into  a  dorsal  (Aid)  and  a  ventral  portion  (AIv). 
Linear  regression  revealed  that  response  threshold, 
monotonicity,  and  SRL  showed  a  more  robust  correla¬ 
tion  with  the  excitatory  bandwidth  values  in  AIv  than  in 
Aid.  This  trend  is  less  prominent  but  still  discernible  for 
the  correlations  with  the  response  to  a  broadband  tran¬ 
sient  and  supports  the  notion  of  a  physiological  distinc¬ 
tion  between  these  two  regions  of  AI. 

Large  responses  to  a  transient  signal  were  predomi- 
nanlly  found  outside  the  areas  of  sharp  tuning  (Schreiner 
and  Mendelson  1990).  A  similar  reciprocal  relationship 
between  nonmonotonicity  and  large  transient  responses 
IS  suggested  by  their  essentially  nonoverlapping  spatial 
distributions  (compare  Fig.  1 1  with  Fig.  16  in  Schreiner 
and  Mendelson  1990).  Since  units  with  nonmonotonic 
responses  usually  have  a  narrow  frequency  tuning  (sec 
Table  3)  and  reflect  strong  inhibitory  influences,  the 
observed  relationship  between  nonmonotonicity  and 
bioudbund  stimulus  response  strength  was  to  be  expected 
(Phillips  et  al.  1985).  The  spatial  coincidence  between  the 
main  area  of  sharpest  frequency  tuning  and  local  minima 
in  the  distribution  of  SRL  and  monotonicity  indicate 
that  tliese  parameters  are  not  independently  organized 
in  .AL 

Biiuuiral  iniercu  tion.  Three  of  the  four  described  parame¬ 
ter  distributions  indicate  an  alignment  of  at  least  one  of 
their  main  spatial  aggregates  with  the  CF  gradient  and. 
consequently,  reflect  orthogonality  with  the  isofrcqucncy 
donum.  The  main  orientational  axis  of  binaural  iiucrac- 
'jon  "bands"  has  also  been  shown  to  be  orthogonal  to 
ihe  isofrcqueni  V  domain  of  Al  of  the  cat  (Imig  and 
Adrian  1977;  Imig  and  Brugge  1978;  Middlebrookset  al. 
1980;  Schreiner  and  Cynadcr  1984).  Although  the  bi¬ 
naural  organization  of  the  maps  under  study  will  be 
described  elsewhere,  it  is  useful  to  say  that  only  the 


response  threshold  showed  some  degree  of  correlation 
with  binaural'ity  as  determined  by  simple  classification  of 
near-threshold  binaural  summation  or  suppression  m 
agreement  with  observations  bv  Schreiner  and  Cvnader 
( 1984). 

Recently.  Imig  and  colleagues  ( 1990)  studied  a  func¬ 
tionally  and  behaviorally  more  relevant  measure  of  bin¬ 
aural  interaction,  that  of  azimuthal  direction  selectivity, 
and  reported  that  the  selectivity  of  neurons  in  cat  .A I  to 
the  azimuthal  direction  of  a  broadband  sound  source  was 
related  to  the  level  selectivity  of  the  neurons.  Neurons 
with  strong  nonmonotonic  rate  level  functions  in  re¬ 
sponse  to  broadband  noise  were  always  highly  direction¬ 
al  selective  and  a  wide  range  of  SRLs  was  associated  with 
a  particular  azimuth.  In  the  same  report,  the  authors 
stated  that  locations  with  similar  azimuth  and  level 
tuning  showed  evidence  of  spatial  clustering  and  that 
several  azimuthal  directions  were  represented  within  a 
narrow  frequency  range  across  AI. 

Deviations  between  the  spatial  distribution  of  low 
SRL  and  nonmonotonicity,  as  reported  in  this  study, 
support  the  notion  that  sharply  tuned  SRLs.  can  be 
found  across  the  whole  intensity  range.  However,  highly 
nonmonotonic  locations,  i.e..  sharply  tuned  to  u  stimulus 
level,  seem  to  be  more  prominent  in  areas  with  low  SRLs. 
at  least  for  pure  tone  nonmonotonicity.  This  may  differ 
from  broadband  rate/ level  functions  as  used  in  the  study 
by  Imig  et  al.  (1990). 

Some  implications  for  functional  organization  ol  A! 

The  three  illustrated  maps  were  chosen  because  the  distri¬ 
bution  of  several  other  response  parameters  arc  known 
for  the  same  cases  and  ot  because  these  were  the  most 
convincing  examples  of  a  systematic  representation  of 
sound  intensity.  The  suggestion  of  a  nonrandom  distri¬ 
bution  of  response  properties  that  are  involved  in  the 
coding  of  sound  intensity,  i.e..  threshold.  SRL.  and 
monotonicity,  has  been  seen  in  essentially  every  auditory 
cortical  map  derived  in  our  laboratory  (c.g..  Merzenich 
cl  al.  1991;  Raggio  et  al.  1992;  Sutter  and  Schreiner 
1991b).  However,  the  apparent  variance  from  one  animal 
to  another  in  the  spatial  distribution  of  intensity-related 
paranieicrs  indeed  appears  to  be  larger  than  that  seen  for 
CF  (c.g..  Merzenich  et  al.  1975;  Reale  and  Imig  1980)  or 
bandwidth  organization  (Schreiner  and  Mendelson 
1990;  Sutter  and  Schreiner  1991a.  b). 

An  important  contribution  to  the  idiosyncratic  or 
"noisy”  appearance  of  a  potentially  ampliotopic  signal 
representation  -  aside  from  the  influences  introduced  by 
the  multiple-unit  recording  method  -  may  have  been  the 
fact  that  the  values  of  the  utilized  intensity  parameicis 
can  be  modified  substantially,  e.g,.  by  certain  back¬ 
ground  noise  conditions  (for  review  sec  Phillips  1990)  or 
by  certain  attention-  or  learning-induced  hisiochemical 
response  modulations  (e.g..  Methcrate  et  al.  1990).  Since 
the  current  results  were  obtained  without  background 
noise  and  in  the  anesthethized  animal,  it  is  likely  that  the 
cortical  representation  of  intensity  parameters  under  be¬ 
haviorally  more  relevant  conditions  may  dilTcr  somewhat 
from  .those  described  here. 
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Following  u  scheme  used  in  establishing  hierarchical 
organization  in  visual  cortex  from  ascending  and  de¬ 
scending  projections.  Rouiller  and  colleagues  (1991)  de¬ 
lineated  an  overall  hierarchy  for  auditory  cortical  fields 
from  their  interconnections.  .M  appears  to  be  at  the 
initial  level  of  processing  and  representation  in  auditory 
cortex  and.  therefore,  is  likely  to  provide  a  more  general 
signal  representation  based  on  a  number  of  coding  as¬ 
pects.  Consequently,  it  is  possible  that  the  need  to  repre¬ 
sent  other  signal  aspects  as  well.  e.g..  binaural  or  band¬ 
width.  may  interfere  with  a  smooth  representation  of 
sound  intensity  in  AI.  .Auditory  fields  located  higher  in 
the  hierachy  of  cortical  processing  may  prove  to  show 
more  specialized  and.  consequently,  more  systematic  and 
spatially  smooth  representations  of  certain  signal  as¬ 
pects.  There  is  some  evidence  that  the  posterior  auditory 
field  may  contain  a  more  precise  map  of  sound  intensity 
(Phillips  and  Orman  1984;  personal  observations)  than 
seen  in  Al. 

Previous  physiological  and  projectiot.al  studies 
provided  evidence  for  a  somewhat  “patchy’*  organization 
of  the  Al  of  the  cat.  In  the  isofrcquency  domain,  binaural 
interaction  sound  localization  properties  appear  to  show 
some  banding  or  clustering  of  similar  response  charac¬ 
teristics  (Imig  and  Adrian  1977;  Imig  and  Brugge  1978; 
Imig  et  al.  1990;  Middlebrooks  et  al.  1980;  Rajan  ct  ai. 
1990).  Intrinsic  connections  in  AI  also  appeared  to  be 
patchy  rather  than  continuous  (Matsubara  and  Phillips 
1988.0Jima  et  al.  1991;  Wallace  and  Bajwa  1991;  Wal¬ 
lace  et  al.  1991a.  b).  The  finding  of  a  rather  patchy  and 
overall  less  gradual  spatial  distribution  of  intensity  pa¬ 
rameters  may  relate  to  those  anatomical  findings  and 
provides  further  evidence  that  the  isofrequency  domain 
in  AI  is  functionally  subdivided  and  does  not  represent 
a  uniform  representational  axis. 
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Abstract.  The  sputiul  distribution  of  ncuronul  responses 
to  frcquency-moduluted  (FM)  sweeps  was  mapped  with 
niiuroelcctrodes  in  the  primary  auditory  cortex  (At)  of 
barbiturate-anesthetized  cats.  Increasing  and  decreasing 
FM  sweeps  (upward-  and  downward-directed  FM 
sweeps,  res|7cctively)  covering  a  range  of  0.25-64.0  kHz 
were  presented  at  three  different  rates  of  frequency 
change  over  time  (i.e.,  sweep  speed).  Using  multiuuit 
recordings,  the  high-frequency  domain  (between  3.2  and 
26.3  kHz)  of  AI  was  mapped  over  most  of  its  dorsoven- 
trul  extent  (us  determined  by  the  distribution  of  the  exci¬ 
tatory  bandwidth,  Q,umt)  tor  all  six  cases  studied.  The 
spatial  distributions  of  the  preferred  sweep  speed  and  the 
preferred  sweep  direction  were  determined  for  each  case. 
Ncutonul  responses  fur  frequency  sweeps  of  different 
speeds  appeared  to  bo  systematically  distributed  along 
thi:  clorsovcntrul  axis  of  AI.  In  the  dorsal  region,  cortical 
cells  fypicully  responded  best  to  fast  und/or  medium  F'M 
sweeps,  followed  more  vcntrally  by  cells  that  responded 
best  to  medium  -  then  slow  then  medium-speed  FM 
sweeps.  In  tlic  mote  ventral  aspect  of  AI  (which  in  some 
oases  may  also  have  included  cells  located  in  the  dorsal 
region  of  the  second  auditory  Held,  All),  neurons  general¬ 
ly  preferred  fust  FM  sweeps.  However,  a  comparison  of 
maps  from  dilVernut  animals  showed  that  there  was  more 
variability  in  the  distribution  of  preferred  speed  respons¬ 
es  in  the  ventral  region  of  the  cortex.  The  directional 
preference  of  units  for  FM  sweeps  was  detennined  for  the 
sweep  speed  producing  the  strongest  response.  Direction 
selectivity  appeared  to  be  nunrundomly  distributed 
along  the  dorsoventral  axis  of  AI.  In  general,  units  that 
responded  best  to  upward-directed  FM  sweeps  were  lo- 
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cated  in  the  mote  dorsal  and  ventral  aspects  of  AI  while 
units  that  responded  best  to  downward-directed  FM 
sweeps  were  usually  located  in  the  mid-region  of  AI.  Di¬ 
rection  selectivity  was  also  determined  for  multiunit  re¬ 
sponses  at  each  of  the  three  FM  sweep  speeds.  In  general, 
there  was  a  relatively  close  agreement  between  the  spatial 
distributions  of  direction  selectivity  determined  for  the 
strongest  response  with  those  calculated  for  the  fast  and 
medium  spe^s.  The  spatial  distribution  of  direction  se¬ 
lectivity  determined  for  slow  FM  sweeps  deviated  some¬ 
what  from  that  determined  for  the  strongest  response. 
Near  the  dorsoventral  center  of  the  mapped  areas,  the 
distribution  of  units  that  responded  best  to  downward 
sweeps  tended  to  overlay  the  distribution  of  units  that 
responded  best  to  slow  speeds,  suggesting  some  spatial 
covariance  of  the  two  parameters.  However,  when  the 
analysis  was  extended  over  the  entire  region  of  cortex 
examined  in  thi.s  study,  the  point-by-point  correlation 
between  preferred  speed  and  direction  .selectivity  was  not 
statistically  signiricunt.  In  addition,  when  neural  respons¬ 
es  obtained  from  the  dorsal  and  ventral  subregions  were 
analyzed  separately,  no  significant  correlation  was  ob¬ 
served  between  these  two  response  parameters.  This  sug¬ 
gests  that,  for  a  given  cortical  locution,  the  response 
projxrtics  of  direction  selcctivit '  and  preferred  spci^  are 
derived  from  distinct  neural  processing  mechanisms.  Sig¬ 
nificant  observations  were  also  made  between  preferred 
FM  sweep  speed  and  excitatory  bandwidth  (i.e.,  Qiudu 
and  Qoojb)  such  that  units  that  responded  best  to  slower 
FM  sfKcds  also  seemed  to  have  higher  Qk^u  <tnd  Qoodu 
(i.c.,  were  narrowly  tuned)  and  vice  versa.  In  addition, 
units  that  responded  well  to  «  broadband  transient  stim¬ 
ulus  in  gczicrul  preferred  faster  FM  sweeps  and  vice  versa. 
Although  these  correlations  were  significant  across  the 
entire  dorsoventral  extent  of  AI  investigated  in  this 
study,  they  were  stronger  for  responses  in  the  dorsal  sub¬ 
region  of  AI.  For  direction  selectivity,  statistically  signifi¬ 
cant  correlations  with  these  response  parumcteis  were 
observed  more  of  ten  in  the  dorsal  than  the  ventral  re¬ 
gions  of  AI.  The  a^vparent  spatial  segregation  of  neuronal 
responses  to  dilTcn  mt  FM  sweep  speeds  and  sweep  direc- 
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tions  disiributed  along  the  isofrequeucy  domain  of  AI 
suggests  that  the  global  aspects  of  cortical  function  are 
compatible  with  psychophysically  derived  notions  of  par¬ 
allel  streams  of  processing  for  different  aspects  of  FM 
signals. 

Key  words:  Frcqueroy-modulatcd  sweep  -  Direction  and 
speed  selectivity  -  Primary  auditory  cortex  -  Topograph¬ 
ical  organi7ation  -  Cat 


Introduction 

Naturally  occurring  auditory  signals  arc  characterized  by 
their  modulation  of  amplitude  and  frequency  composi¬ 
tion  over  time.  These  types  of  modulation  are  evident  in 
various  aspects  of  communication  signals  us  well  as  in 
sound  sources  that  move  relative  to  an  observer.  Rele¬ 
vant  stimuli  fur  studying  neuronal  responses  to  these  dy¬ 
namic  acoustical  properties  include  umplitude-mudulut- 
cd  (AM)  and  frequency-modulated  (FM)  signals.  It  has 
been  shown  in  difTercnt  stages  of  the  auditory  pathway 
that  neurons  are  responsive  to  AM  and  FM  signals.  Fur 
AM  stimuli,  the  range  of  modulation  frequencies  that 
produce  excitatory  responses  in  neurons  decreases  along 
the  ascending  auditory  pathway,  indicating  the  process¬ 
ing  of  a  mure  limited  range  of  temporal  events  in  higher 
auditory  structures  (Rees  and  Mollcr  1983;  Schreiner  and 
Urbas  1986, 1988;  Schreiner  and  Lungner  1988).  Ascend¬ 
ing  in  the  auditory  pathway  from  the  periphery  to  the 
cortex,  selectivity  of  neurons  for  certain  features  of  FM 
stimuli,  such  as  tiie  direction  (i.c,  upward-directed; 
changing  from  a  low  to  a  high  frequency,  or  vice  versa)  or 
siwed  3f  frequency  sweeps  (i.c.,  rate  of  change  of  frequen¬ 
cy  over  time),  appears  to  increase  (Sugu  i965u;  Whitfield 
and  Lvuns  1965;  Walunabc  1972;  Mollcr  1974;  Nomoto 
1980;  Shore  and  Nuttall  1985;  Poon  ct  al.  1991;  Mcndcl- 
son  and  Grasse  1992).  At  the  level  of  the  primary  audito¬ 
ry  cortex  (AI)  of  cats,  the  majority  of  neurons  arc  selec¬ 
tive  for  a  particular  direclior.  and/or  sjwcd  of  FM  sweep 
(Mcndclson  and  Cynader  1985;  Mciidclson  and  Grasse 
1992).  For  example,  Mcndclson  and  colleagues  have 
shown  in  their  single  unit  investigations  of  Al  that,  of 
those  cells  that  arc  direction  selective,  the  majority  prefer 
downward-directed  as  opposed  to  upward-directed  FM 
sweeps  (Mcndclson  and  Cynader  1985;  Mcndclson  and 
Grasse  1992).  They  also  observed  that  the  majority  of 
cortical  cells  preferred  relatively  fast  speeds  (0.4-0.8  IcH:, 
ms)  of  FM  sweeps  (Mcndclson  and  Cynader  1985; 
Mcndclson  and  Grasse  1992).  In  addition,  they  found 
that  the  preferred  FM  speed  and  the  degree  of  direction 
selectivity  varied  considerably  from  neuron  to  neuron. 

A  pertinent  question  regarding  FM  and  AM  stimulus 
responses  is  their  functional  organization  within  the  au¬ 
ditory  cortex.  Topographical  functional  orgamzatioti  ap¬ 
pears  to  be  a  general  feature  of  ncocortical  specialization 
(Mountcastlc  1957;  Hubei  and  Wicscl  1965).  For  exam- 
pie,  in  the  auditory  cortex  of  the  bat,  the  interpretation  of 


functional  specialization  has  been  greatly  enhanced  by 
understanding  the  underlying  topographical  organiza¬ 
tion  (Suga  1965a;  O’Neill  and  Suga  1982;  Tsuzuki  and 
Suga  1988;  Edamatsu  ct  al.  1989).  Suga  and  his  col¬ 
leagues  (Suga  1965a,  1986;  O’Neill  and  Suga  1982; 
Tsuzuki  and  Suga  1988;  Edamatsu  ct  al.  1989)  have 
shown  that  neuronal  responses  to  particular  aspects  of 
FM  stimuli,  a  crucial  component  of  the  bat’s  biosonur 
system,  are  topugraplrically  organized  in  the  auditory 
cortex.  In  fact,  they  have  identified  an  entire  cortical  area 
devoted  exclusively  to  the  processing  of  FM  signals. 

Until  recently,  only  two  response  features  were  known 
to  be  systematically  distributed  in  the  auditory  cortex  of 
other  mammals  such  as  the  cut.  These  features  arc  the 
characteristic  frequency  (CF),  which  is  distributed  along 
the  rostrocaudai  axis  (Woolsey  and  Walzl  1942; 
Mcrzcnich  ct  al.  1975;  Rcalc  and  Imig  1980),  and  alter¬ 
nating  bands  or  patches  of  binaural  faciiitatory  (EE)  and 
binaural  inhibitory  (El)  cells,  disttibuted  along  the 
dursovcntral  extent  of  AI  (Imig  and  Adrian  1977; 
Middlebrooks  ot  al.  1980).  More  recently,  Rcalc  and  Ket- 
tner  (1986)  extended  these  tesults  by  showing  that  cells 
located  in  the  ventral  region  of  cat  AI  contain  both  facii¬ 
itatory  and  inhibitory  response  characteristics,  depend¬ 
ing  upon  the  stimulus  intensity  presented.  Within  the  last 
few  years,  evidence  has  begun  to  accumulate  which 
shows  that  other  functional  response  properties  are  sys¬ 
tematically  distributed  in  AI.  For  example,  Schreiner  and 
Mcndclson  (1990)  have  shown  that  “integrated  excitato¬ 
ry  bandwidth”  (i.c.,  Q,oju  and  O^oju)  and  lesponscs  to 
broadband  transients  are  topographically  arranged  with¬ 
in  AI.  The  principal  gradient  of  the  integruted  excitatory 
bandwidth  was  found  to  be  oriented  orthogonally  to  the 
cochleotopic  gradient.  Other  response  parameters  such 
as  inhibitory  sidebands  (Sutter  and  Schreiner  1991),  stim¬ 
ulus  intensity  responses  and  monotonicity  (Schreiner  ct 
al.  1992)  have  also  been  found  to  be  systematically  dis¬ 
tributed  in  the  auditory  cortex.  In  addition,  certain  prop¬ 
erties  of  sound  localization  cues  have  been  shown  to  be 
systematically  distributed  in  the  auditory  cortex  (Imig  ct 
al.  1990;  Rajan  ct  al.  1990). 

While  we  now  have  a  greater  understanding  of  the 
spatial  distribution  of  many  discrete  cortical  response 
properties,  we  do  not  know  if  or  how  responses  to  dy¬ 
namic  signal  features  arc  distributed  within  the  auditory 
cortex.  Given  that  Mcndclson  and  colleagues  (Mendel- 
son  and  Cynader  1985:  Mcndclson  and  Grasse  1992) 
have  shown  that  the  majority  of  auditory  cortical  single 
units  arc  selective  for  the  direction  and/or  speed  of  FM 
sweep,  we  addressed  the  question  of  whether  neurons 
with  similar  FM  response  properties  arc  randomly  dis¬ 
tributed  across  the  cortex,  or  whether  FM  responses 
show  a  systematic  spatial  organization  across  the  prima¬ 
ry  auditory  cortical  field.  Specifically,  we  explored  neu¬ 
ronal  responses  t(^  the  direction  and  speed  of  FM  sweeps 
over  a  wide  cortical  area  to  sec  if  the  regional  distribution 
of  these  responses  reflects  on  organized  representation  of 
these  stimulus  parameters  in  AI. 
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Materials  and  methods 

Preparation 

A  more  detailed  deKriptioit  of  the  preparation  and  recording  meth¬ 
ods  has  been  presented  in  a  previous  paper  in  this  series  (Schreiner 
and  Mendclson  1990).  Briedy,  experiments  were  conducted  on  six 
adult  cats  shown  otoscopically  to  have  clean  ears.  Animals  were 
initially  tranquilued  with  ketamine  hydrochloride  (10  mg/kg)  and 
acetylpromazine  maleate  (0.1  mg/kg)  intramuscularly  to  allow  for 
venous  cannulaiion.  Animals  were  then  given  an  i.v.  injection  of 
pentobarbital  sodium  (Nembutal,  30  mg/kg)  followed  by  intramus¬ 
cular  injections  of  atropine  (1  mg)  and  dexomethasone  sodium 
phosphate  (0.14  mg/kg).  Animals  were  maintained  at  a  surgical  level 
of  anesthesia  throughout  the  duration  of  the  experiment  with  an  i.v. 
infusion  of  Nembutal  (2  mg/kg  per  h)  in  lactated  Ringer  solution 
(infusion  volume  3.5  ml/h).  Body  temperature  was  maintained  at 
3?.5“C 

Both  pinnae  were  rellected  and  the  auditor^  meatuses  transect¬ 
ed  to  allow  for  placement  of  the  earphones  within  3  mm  of  the 
tympanic  membrane.  The  temporal  muscle  on  the  right  side  was 
retracted  and  u  craniotomy  made  to  expose  the  right  middle  ccto- 
sylviun  gyrus.  All  wound  margins  and  pressure  points  were  inHltrat- 
ed  with  a  long-acting  local  anesthetic  (bupivucainc  hydrochloride 
2.5%).  The  dura  mater  was  resected  and  the  underlying  brain  was 
covered  with  silicone  oil.  A  photograph  of  the  suifuce  vasculature 
wu.s  used  to  record  the  locations  of  the  electrode  irenctrations. 


2>(imuitition  and  recording 

The  imimul  was  placed  in  a  head  holder  that  permitted  unobstruct¬ 
ed  access  to  the  auditory  meutu.ses.  7'he  animal  was  located  in  an 
electrically  shielded,  sound-attenuating  chamber  (lAC).  Insulated 
tungsten  electrodes  were  introduced  into  the  cortex  orthogonal  to 
the  surface  (as  viewed  through  a  Zeiss  operating  microscope)  by 
means  of  a  remote-controlled  stepping  motor  microdrivc.  I'or  most 
cases,  single-unit  and  unit  cluster  responses  were  examined  at  one 
location  per  electrode  penetration  at  a  depth  between  600  and  1000 
pm  below  the  cortical  surface,  corresponding  roughly  to  layers  III 
and  IV.  However,  for  a  small  sample  of  penetrations,  single  units 
were  recorded  at  a  number  of  different  depths  (ranging  from  500  to 
1800  pm).  Tlie  level  discriminator  for  spike  selection  was  set  at 
50-100%  above  the  noise  level  of  the  lecording.  The  number  of 
units  |)er  recording  location  was  estimated  to  be  Ixttwcen  2  and  8. 
Stimulu.s-cvokcd  action  potential  event  times  wore  collected  and 
sloied  on-line  by  a  PI3P  11/73  computer. 

Cuts  were  initially  stimulated  with  tone  bursts  (50  r.is  duration 
with  u  3-ms  rise/full  time)  presented  in  a  pseudorandom  sequence  of 
frequency-intensity  cembinutions  selected  from  15  intensity  levels 
(5-dB  steps)  and  45  frequency  values  (geometrically  centred  around 
the  CP)  as  well  us  with  broadband  transient  signals  (sec  Schreiner 
and  Mendclson  1990).  Following  this,  cells  were  tested  with  expo¬ 
nential  FM  sweeps  extending  from  0.25  to  64.0  kHz  (upward-direct¬ 
ed)  or  from  64.0  to  0.25  kHz  (downward-directed)  generated  by  a 
computer-triggered  function  generator  (Wavetek  185).  Exponential 
FM  sweeps  were  used  in  this  study  for  a  number  of  reasons.  First, 
the  broad  FM  sv/cep  range  ensures  that  the  signal  starts  and  ends 
outside  tile  excitatory  and  inhibitory  response  ureas  of  the  neurons 
under  study,  thus  providing  u  unifonn  stimulation  paradigm.  Sec¬ 
ond.  exponential  FM  sweeps  bear  a  close  resemblance  to  the  fre¬ 
quency  organization  of  the  basilar  membrane  resulting  in  similar 
I'M  sweep  parameters  across  the  range  of  frequencies  examined 
without  the  need  to  adjust  the  speed  according  to  the  characteristic 
frequency  (CF)  of  each  neuron.  Rates  of  frequency  change  (i.c., 
sweep  speeds)  between  1  and  110  octaves  per  second  were  used. 
However,  for  the  majority  of  the  multiple-unit  recordings,  only 
tlirce  sweep  rate:  were  used;  1 10  octaves  per  second.  33  octaves  per 
second,  and  10  octaves  [)cr  second.  The  corresponding  sweep  dura¬ 
tions  for  these  FM  sweep  s|>ecds  were  72,  240.  and  800  ms,  respec¬ 


tively.  The  presentation  rate  of  cither  an  upward-  or  downward-di¬ 
rected  FM  sweep  was  1  sweep  per  second  and  was  independent  of 
the  sweep  speed.  The  sweep  speed  was  held  constant  within  each 
FM  sweep  presentation,  but  was  varied  from  one  set  of  trials  to  the 
next.  Each  FM  sweep  was  preceded  by  a  2(X>-ms  interval  containing 
the  upper  or  lower  starting  frequency  of  the  sweep  (both  of  these 
frequencies  were  beyond  the  average  liearing  range  of  the  cat  by  at 
least  1. 3-2.5  octaves  from  the  CF)  in  order  to  avoid  the  risk  of 
contaminating  the  data  with  any  equipment  related  broadband 
transient  click  at  the  beginning  of  each  FM  sweep  presentation.  For 
the  FM  sweeps  whose  speeds  were  110  and  33  octaves  per  second 
the  FM  sweeps  were  followed  by  a  628-  or  S60-ms  interval  of  the 
upper  or  lower  final  frequency  of  the  FM  sweep,  respectively.  The 
onset  of  each  FM  sweep  stimulus  was  triggered  by  the  computer. 

Both  the  pure  tone  and  FM  sweep  stimuli  were  attenuated 
(Hewlett-Packard)  and  passed  through  transducers  (STAX  .54)  at¬ 
tached  to  sound  delivery  tubes  sealed  into  the  auditory  meatuses. 
The  system  was  calibrated  for  pure  tones  with  a  sound-level  meter 
(13iuel  and  Kjuer  2209).  Tlie  transfer  function  of  the  sound  delivery 
system  showed  a  roll-off  of  iU  dB/octave  above  14  kHA  and  addi¬ 
tional  deviations  of  up  to  ±6  dll  were  possible  (sec  Fig.  1  in 
Schreiner  anti  Mendclson  1990). 


Recording  procedure 

Initially,  tlie  CF,  response  threshold,  and  frequency  response  area  of 
u  neuron  or  group  of  neurons  were  determine.  Among  other 
parameters,  the  excitatory  bandwidth  at  10  dB  and  40  dB  above 
minimum  threshold  (expressed  as  Q|(>,,||  and  Q4UdH>  respectively)  and 
response  to  u  broadband  transient  stimulus  were  also  determined 
(sec  Schreiner  and  Mendelson  1990).  I'ollowiiig  these  characteriza¬ 
tions,  FM  sweeps  were  typically  presented  munaurally  to  the  con¬ 
tralateral  cat.  For  the  small  sample  of  cells  encounter^  which,  by 
conventional  standards,  would  be  classified  as  piedominuntiy 
binaural  cells,  FM  sweeps  were  presented  binuurally. 

FM  sweeps  weie  presented  at  a  stimulus  level  approximately  40 
dB  above  tlie  minimum  threshold.  This  stimulus  level  was  selected 
for  a  number  of  reasons.  First,  we  wanted  to  include  the  majority  of 
neurons  at  a  given  recording  site.  If  FM  sweeps  were  presented  at 
10  or  20  dB  above  threshold,  then  units  with  high  thresholds  may 
have  been  inadvertently  excluded  from  the  sample.  Second,  by  pre¬ 
senting  FM  sweeps  ut  40  dB  above  threshold,  more  units  would  be 
stimulated  maximally.  Finally,  while  it  has  been  shown  in  single- 
unit  studies  that  nonmonotonic  units  can  exhibit  a  rate  decrease 
with  an  increase  in  stimulus  intensity  (Phillips  and  Irvine  1981),  wc 
fell  that  the  possibility  of  recording  weaker  responses  in  some  non¬ 
monotonic  celi.<!  was  preferable  to  tiie  possibility  of  not  recording 
any  responses  from  cells  with  higlicr  thresholds. 

Each  of  the  six  I'M  sweep  conditions  (two  directions  at  three 
speeds)  was  presented  4U  times,  and  poststimuius  time  histograms 
(PSTHs)  were  constructed,  lire  number  of  spikes  within  a  lO-ms 
window  around  the  maximum  response  for  each  condition  was  used 
as  a  measure  of  response  strength  for  each  stimulus  condition. 

With  the  exception  of  one  cxpcriraeni,  u  conical  sunacc  area  of 
approximately  3x5  mm  was  examiued  with  80-150  electrode  pene¬ 
trations  per  map.  In  one  case,  the  cortical  area  sampled  was  3  x  2.2 
mm.  The  recording  strategy  was  to  map  the  dorsoventrul  domain  of 
AI  corresponding  to  the  orientation  of  the  isofrcqucncy  axis.  In 
several  cases,  the  most  dorsal  aspects  of  the  second  auditory  field 
(All)  were  included.  The  range  of  CFs  investigated  across  uli  ani¬ 
mals  extended  from  3.2  to  26.3  kHA 


Data  analysis 

Frequency  response  areas  derived  from  the  675  different  frequency- 
intensity  combinations  were  determined  for  every  unit  cluster.  From 
this  wc  were  able  to  derive  the  integrated  excitatory  bandwidth  at 
10  and  40  dB  (Qioja  and  Q^odsi  respectively).  Following  Shis,  two 
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methodji  were  used  to  analyTC  the  speed  sensitivity  of  neurons  at  a 
given  cortical  location.  The  first  type  of  analysis  took  into  account 
the  strength  of  responses  for  all  three  FM  speeds  tested  and  provid¬ 
ed  a  non-quantal  measure  of  preferied  speed,  independent  of  the 
total  firing  rate  at  a  given  cortical  location.  For  this  type  of  analysis, 
the  following  equation  was  used; 

(BRs  +  (2  X  BR„)  (  (3  x  BRp))/(BRs  +  BRm  +  BRp) 

where  BR  is  the  best  response  (i.e.,  the  highest  firing  rate)  of  the  two 
directions  at  each  of  the  three  FM  sweep  rates  (S,  slow;  M,  medium: 
F,  fast).  This  resulted  in  a  number  between  1  and  3  where  1  repre¬ 
sented  an  exclusive  response  to  the  slow  speed  and  3  an  exclusive 
response  to  the  fast  speed.  Numben  close  to  2  either  represented  a 
response  to  the  medium  speed  or  indicated  a  response  with  equal 
response  to  all  three  speeds,  i.e.,  a  response  without  speed  selectivi¬ 
ty.  This  potential  ambiguity  affected  an  average  of  lOVo  of  all  coni¬ 
cal  locations  examined.  However,  the  second  type  of  analysis  for 
FM  speed  .selectivity  described  below,  helped  to  further  distinguish 
whether  a  particular  esponse  was  for  medium  FM  sweep  speeds  or 
was  non-speed  selective 

The  second  type  of  analysis  provided  a  discrete  measure  of  the 
preferred  .speed  as  compared  to  the  continuous,  nondiscrete  mca' 


sure  described  above.  This  second  method  involved  selecting  the 
speed  condition  that  elicited  at  least  23%  more  spikes  than  either  of 
the  other  two  FM  speeds  and  assigning  it  the  number  1  if  that 
condition  was  the  slow  FM  sweep,  the  number  2  if  it  was  a  medium 
sweep,  and  the  number  3  if  it  was  a  fast  FM  sweep.  For  those  cases 
where  the  resnonse  difference  between  slow  and  medium  or  medium 
and  fast  was  less  than  23%,  but  each  exceeded  that  of  the  remaining 
sweep  speed  by  more  than  25%,  the  numbers  1.5  and  2.5  were 
assigned,  respectively.  If  none  of  these  criteria  were  met,  the  location 
was  classified  as  nonselective  for  FM  speeds. 

The  direction  selectivity  (DS)  of  a  neuron  or  neuron  cluster  was 
determined  by  comparing  its  response  to  upward-  and  downward- 
directed  FM  sweeps  at  its  preferred  speed.  The  degree  of  direction 
selectivity  was  quantified  using  the  following  equation: 


where  RU  the  firing  rate  elicited  by  upward-directed  FM  sweeps 
and  RD  to  dowmward-dirocted  FM  sweeps.  Values  ranged  from  + 1 
(responding  exclusively  to  upward-directed  FM  sweeps)  to  —  1  (re¬ 
sponding  exclusively  to  downward-directed  FM  sweeps).  This  mea¬ 
sure  of  directiou  selectivity  was  insensitive  to  the  units*  overall  firing 
.ate  and  thus,  was  not  co^ounded  by  rate  changes  due  to  the  speed 


DS  =  RU-RD/(RU-I-RD) 


O.  □  160.0  320.0  4B0.  0  640.0  BOO.  D 
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Fig.  1.  An  example  of  a  multiunit  response  to  frequency-modulated 
(FM)  sweeps.  The  panels  on  the  left  illustrate  the  responses  to  up¬ 
ward-directed  FM  sweeps,  while  those  on  the  right  are  to  down¬ 
ward-directed  FM  sweeps.  Tlic  responses  to  the  fastest  FM  sweeps 
are  shown  in  the  top  two  panels,  the  responses  to  the  medium  FM 


0.0  IBP.  0  320,0  480.0  B40.  0  BOO.  0 

TIm  (naac) 

sweeps  are  in  the  middle,  and  the  responses  to  the  slowest  arc  shown 
in  the  bottom  two  panels.  The  bar  under  each  abscissa  depicts  the 
duration  of  the  FM  sweep  for  each  of  the  three  speeds.  Tliis  unit 
cluster  responded  most  vigorously  to  upward-directed  FM  sweeps 
at  a  speed  of  33.3  octaves  per  second 
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of  FM  sweep.  For  example,  one  might  pre  ! .. "  that  the  firing  rate  of 
a  unit  duster  to  a  slow  FM  sweep  could  be  greater  than  the  firing 
rate  of  a  unit  cluster  to  a  fast  FM  sweep  because  the  multi-unit 
would  be  stimulated  in  its  exdtaiory  response  area  for  a  longer 
period  of  time  with  slow  FM  sweeps  than  if  it  were  stimulatr'd  with 
a  faster  FM  sweep.  Unit  responses  were  considered  to  be  direction 
selective  if  their  index  value  was  equal  to  or  greater  than  ±0.15, 
indicating  approximately  35%  more  spikes  for  one  direction  than 
the  opposite  direction. 

The  spatial  distribution  of  the  response  properties  are  presented 
as  pseudo  three-dimensional  projections.  These  types  of  plots  were 
generated  from  the  original  data  by  a  software  package  (Golden 
Software).  The  interpolation  algorithm  used  (inverse  distance 
squared)  inhereutiy  applied  some  minimal  spatial  smoothing  to  the 
data,  resulting  mainly  in  a  slight  compression  of  the  encountered 
functional  range  and  a  local  averaging  of  closely  spaced  adjacent 
points  (see  Schreiner  and  Mendelson  1990).  This  method  empha¬ 
sizes  topographical  organizations  with  low  spatial  frequency  and 
de-emphasizes  steep  local  gradients  or  variabilities.  Examples  are 
given  that  also  show  the  raw  data,  thus  allowing  a  direct  evaluation 
of  the  appropriateness  of  the  pseudo  three-dimensional  representa¬ 
tion  of  the  data. 


Results 


The  present  results  are  based  on  506  single-  and  multi- 
unit  responses  encountered  in  closely  spaced  electrode 
penetrations  over  a  wide  region  of  AI  in  six  animals.  The 
distance  between  each  penetration  varied  from  ISO  to  350 
pm.  With  the  exception  of  one  case  (87  420),  the 
dorsoventral  extent  of  cortex  mapped  in  these  experi¬ 
ments  was  approximately  5  mm.  Within  each  map,  a 
large  portion  ( >  75%)  of  the  tonotopic  map  indicative  of 
Al  was  observed  (Merzcnich  ct  al.  1975;  Realc  and  Imig 
1980),  All  recording  sites  examined  contained  responses 
to  pure  tones  and  to  at  least  one  of  the  FM  sweep  direc¬ 
tion  and/or  speed  conditions.  The  results  based  on  the 
integrated  excitatory  bandwidth  and  responses  to  broad¬ 
band  transient  stimuli  are  described  in  detail  by  Schrein¬ 
er  and  Mendelson  (1990)  and  thus  will  only  be  summa¬ 
rized  here  at  the  end  of  the  Results. 

Figure  1  illustrates  a  typical  set  of  responses  from  a 
multiunit  cluster  at  a  single  cortical  site  to  the  six  FM 
sweep  conditions  examined  in  this  study.  The  three  pan¬ 
els  on  the  left  show  multiunit  responses  to  upward-direct¬ 
ed  FM  sweeps,  while  the  three  panels  on  the  right  show 
responses  to  downward-directed  FM  sweeps.  Figure  1 
also  illustrates  how  cortical  responses  are  affected  by  dif¬ 
ferent  FM  sweep  speeds;  the  top  two  panels  show  re¬ 
sponses  to  fast  sweeps  (110  octaves  per  second),  the  mid¬ 
dle  panels  show  responses  to  medium  sweeps  (33  octaves 
per  second),  and  the  bottom  panels  show  responses  to 
slow  sweeps  ( 10  octaves  per  second).  As  can  be  seen,  cells 
at  this  cortical  site  responded  best  to  upward-directed 
FM  sweeps  presented  at  the  medium  speed.  Generally, 
only  one  dominant  response  maximum  in  the  PSTH  was 
observed  during  each  FM  sweep  presentation.  Occasion¬ 
ally,  a  second  smaller  response  was  present.  However,  the 
relative  timing  of  secondary  maxima  (relative  to  the  dom¬ 
inant  response  maximum)  was  independent  of  the  sweep 
rate  used.  This  suggests  a  rebound  effect  rather  than  a 
response  to  a  second  excitatory  frequency  region  or  an 
amplitude  modulation  effect. 


B 


Fig.  2.  Dependence  of  preferred  FM  speed  selectivity  (A)  and  dire<'- 
tion  selectivity  (B)  on  cortical  recording  depth.  The  response  para¬ 
meters  for  the  same  five  penetrations  are  plotted  as  a  function  of 
recording  depth  (radial  penetrations).  No  statistically  significant 
correlation  between  recording  depth  and  preferred  FM  sweep  speed 
or  direction  selectivity  was  observed.  Pen  I,  penetration  1;  Pen  2, 
penetration  2;  etc 


While  most  multiunit  cortical  responses  described 
here  were  recorded  at  a  single  depth  per  electrode  pene¬ 
tration,  for  one  case  we  examined  single  unit  responses  at 
several  depths  within  a  single  penetration.  Figure  2 
shows  examples  of  preferred  FM  sweep  speed  responses 
(Fig.  2A)  and  direction-selective  responses  (Fig.  2B)  as  a 
function  of  cortical  depth  for  five  different  electrode  pen¬ 
etrations.  As  illustrated,  neither  the  preferred  speed  nor 
the  preferred  direction  appeared  to  change  signiilcantly 
as  a  function  of  cortical  depth,  especially  for  responses 
encountered  more  than  650  pm  from  the  cortical  surface. 

As  described  in  Materials  and  methods,  responses  to 
FM  sweeps  were  obtained  using  a  stimulus  level  of  40  dB 
above  response  threshold.  For  a  small  sample  of  multi¬ 
units  (n  =  10),  however,  we  examined  the  responses  to  FM 
sweeps  presented  over  a  range  of  stimulus  levels  to  ascer¬ 
tain  the  possible  effects  of  this  variable  on  preferred  FM 
speed  (Fig.  3A)  and  preferred  direction  (Fig.  3B).  The 
linear  regression  analysis  for  stimulus  level  and  preferred 
FM  speed  was  statistically  significant.  However,  the  slojx: 
of  the  line  indicates  only  a  small  shift  toward  preferences 
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Fig.  3A,B.  Dependence  of  preferred  FM 
epoed  and  direction  selectivity  on  stimulus 
level  of  FM  sweep.  A  The  pr^erred  PM 
sweep  speed  is  plotted  as  a  function  of  stim¬ 
ulus  level  re  minimum  threshold  (0  dB).  Re< 
sponies  from  five  tecording  locations  from 
the  middle  conical  laminae  are  shown.  The 
dashed  line  is  the  result  of  a  linear  regression 
analysis  [y—0.07  x-0.33;  r—0.76;  F>«0.0001 
(.'^■test)].  B  The  linear  regression  analysis  for 
direction  selectivity  as  a  function  of  stimulus 
level  is  indicated  by  the  dashed  line  [y* 

-0.02  X  -(-0,C«;  r-0.41 ;  P-0.03  (F-test)]. 

Pen  1,  penetration  1;  Pen  2  penetration  2; 
etc 
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for  faster  FM  sweeps  at  higher  stimulus  levels.  For  the 
effects  of  stimulus  level  on  direction  selective  responses, 
the  linear  regression  analysis  was  also  statistically  signifi¬ 
cant  (Fig.  3B).  However,  as  with  preferred  FM  speed  and 
stimulus  level,  the  shallow  slope  of  th:  line  suggests  that 
there  is  a  minimal  change  from  upward-  to  downward- 
directed  FM  sweeps  as  stimulus  level  is  increased. 


Representation  of  preferred  speed 

Tabic  1  provides  the  distribution  of  preferred  FM  sweep 
rates  for  each  individual  case,  as  well  as  for  all  cases 
combined.  In  general,  the  largest  percentage  of  recording 
sites  (39.8%)  was  marked  by  responses  prefeniiig  fast 


FM  sweep  rates.  In  all  but  one  case  (86-697),  preferences 
for  fast  FM  sweeps  were  encountered  at  least  twice  as 
often  as  prefereuces  for  medium  or  slow  FM  sweeps.  The 
most  striking  anisotropy  for  fast  versus  medium  and  slow 
sweep  speeds  was  observed  in  case  87-129  where  61.7% 
of  the  responses  encountered  preferred  fast  FM  sweep 
rates.  Excluding  those  sites  in  which  responses  were 
strongly  driven  by  more  than  one  rate  of  FM  sweep,  the 
second  largest  percentage  of  observed  responses  (16.7%) 
preferred  medium-speed  FM  sweeps,  followed  by  yet  a 
smaller  percentage  (9.3%)  that  preferred  slow  FM 
sweeps.  At  approximately  10%  of  the  cortical  locations 
examined,  recorded  responses  exhibited  no  preference  for 
the  three  FM  speeds  studied. 


Ill  order  to  demcristratc  th&t  the  FM  swe 
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Table  I.  Distribution  of  prdetred  frequency-modulated  sweep  rates 


Cat 

Number 
of  sweeps 

Slow 

(%) 

Slow/ 

medium  (%) 

Mediurii 

(%) 

Medium/ 
fast  (%) 

Fast 

(%) 

No  prefereuce 
(%) 

86-697 

88 

5.7 

9.1 

27.3 

20.3 

27.3 

10.2 

87-420 

63 

14.3 

1.6 

15.9 

20.6 

41.3 

6.3 

87-518 

81 

12.4 

4.9 

13.6 

16.0 

42,0 

11.1 

87-706 

83 

3.6 

8.4 

20.5 

19.3 

41.0 

7.2 

87-129 

81 

6.2 

2.5 

6.2 

11.1 

61.7 

12.3 

87-001 

96 

14.6 

8.3 

15.6 

22.9 

29.2 

9.4 

Total 

492 

9.3 

).  . 

16.7 

18.5 

39.8 

9.6 

I 
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Fig.  4.  Rtttc  tuning  profile  of  u  single-unit  to  u  range  of  FM  sweep 
speeds.  This  neuron  is  representative  of  u  small  sample  of  single- 
units  presented  with  a  range  of  sweep  speeds,  both  slower  and  faster 
than  the  three  used  in  the  study.  The  response  is  expressed  as  the 
percentage  of  the  maximum  that  occurred  for  downward-  directed 
FM  sweeps  modulating  at  a  rate  of  33  octaves  per  second.  The 
arrows  indicate  the  three  sweep  speeds  used  in  the  study.  In  general 
the  responses  to  the  additional  FM  .sweep  speed  conditions  provid¬ 
ed  a  gradual  transition  between  the  three  speeds  used  to  generate 
the  spatial  distribution 
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used  in  the  present  study  (u)  were  within  the  range  of 
speeds  that  cortical  neurons  can  respond  to  and  (b)  sam¬ 
pled  the  appropriate  range  of  speeds  with  sufficient  densi¬ 
ty,  a  wider  range  of  FM  speeds  was  presented  to  a  sample 
of  single  cortical  units  (n-<  10).  This  expanded  range  of 
FM  speed  conditions  included  slower  and  faster  rates 
than  those  used  in  the  mapping  studies.  These  FM  rates 
were  spaced  widely  enough  to  cover  a  broad  ra>  'i  of 
speeds  yet  narrowly  enough  to  detect  prominent  peaks  i.i 
the  sjjccd  profile.  Figure  4  shows  a  representative  exam¬ 
ple  of  the  rate  tuning  profile  of  a  single  unit  to  this  broad¬ 
er  range  of  FM  sweep  speeds.  The  response  is  expressed 
as  a  percentage  of  the  maximum  response.  The  arrows 
indicate  those  three  speed  conditions  that  were  used  to 
determine  the  spatial  distribution  of  FM  sweep  responses 
in  the  present  study.  A.S  illustrated,  this  neuron  respond¬ 
ed  best  to  downward-directed  FM  sweeps  changing  at  a 
rate  of  33  octaves  per  second;  i.c.,  the  medium  FM  sweep 
speed  condition  used  throughout  the  mapping  study.  In 
general,  FM  speed  profiles  were  unimodal,  fairly  broadly 
tuned,  and  showed  smooth  transitions  in  the  response 
strength. 

Figure  5  shows  the  distrihution  of  preferred  speeds  of 
FM  swccD  as  a  function  of  CF  for  two  cases  in  which  a 
narrow  and  a  broad  range  of  CFs  were  sampled,  respec¬ 
tively.  Id  both  ca.scs,  preferred  speed  did  not  seem  to 
depend  upon  CF.  Further,  there  was  a  uniform  distribu¬ 
tion  of  values  over  the  range  between  1.5  and  2.5.  al- 
th.ough  some  clustering  of  values  around  2.0  was  dis¬ 
cernible. 


10  so  40 

CF(tlHz) 

Fig.  SA.I).  Scatter  diagrams  of  preferttd  FM  sweep  rate  os  a  (unc¬ 
tion  of  charucteristic  frequency  (CF).  All  of  the  preferred  speed 
responses  are  plotted  as  a  function  of  CF  for  two  cases.  In  general, 
preferTcd  speed  appeared  to  be  independent  of  CF.  Tlic  number  I 
represents  a  preference  for  slow  spe^,  the  number  2  for  medium 
speeds,  and  3  for  fast  speeds 


In  most  cases,  there  appeared  to  be  an  organued,  non¬ 
random  spatial  distribution  of  preferred  speed  of  r'M 
sweep  responses  in  Al.  Figure  6  illustrates  an  example  of 
a  spatially  reconstructed  distribution  of  preferred  FM 
speeds  across  a  high-frequency  region  of  AI.  The  cortical 
sites  recorded  throughout  this  region  were  neural  re¬ 
sponses  with  CFs  between  9.6  and  12.4  kHz.  In  the  pseu¬ 
do  three-dimensional  projections,  the  elevation  of  the 
surface  of  the  plot  (along  the  Z-axis)  represents  the  pre¬ 
ferred  speed  sampled  at  a  given  recording  location.  Thus, 
low  elevations  represent  cortical  locations  containing 
preferences  for  slow  FM  sweeps  and  higher  elevations 
indicate  cortical  regions  in  which  preferences  for  fast  FM 
sweeps  were  observed.  The  dotted  line  indicates  a  prefer¬ 
ence  for  medium  FM  speeds  (denoted  by  the  number  2) 
and  has  been  included  as  an  aid  to  the  reader.  Arrow¬ 
heads  along  the  X-axis  (the  anteroposterior  extent  of  AI) 
indicate  the  approximate  orientation  of  the  isofrequency 
contours.  The  Y-axis  represents  the  dorsovcntral  extent 
of  the  map,  wlticb  in  this  case  measured  2.2  mm.  The 
numbers  in  the  box  below  the  pseudo  three-dimensional 
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Fig.  6.  Tlie  spatial  disirbution  of  preferred  speed  as  determined  by 
tlie  non-quantal  measurement.  7be  upper  portion  depicts  a  pseudo 
threc-dimcnsiunal  projection  of  the  spatial  distribution  of  preferred 
speed.  Two  dimensions  of  the  projection  represent  the  anteroposte¬ 
rior  (a.p)  and  dorsoveniral  (d,v)  extent  of  the  cortical  surface.  The 
arrowheads  along  the  anteroposterior  dimension  indicate  the  ap¬ 
proximate  onentation  of  the  isnfrcqucncy  contours.  The  elevation 
of  the  surface  of  the  plot  conesponds  to  the  preferred  speed  across 
the  cortical  surface.  A  smoothing  factor  of  0.9  was  applied  to  the 
data,  resulting  in  a  small  amount  of  additional  smoothing  (see 
Schreiner  and  Mcndcison  1990).  The  numbers  along  the  elcvational 
axis  refer  to  ihe  maximum  and  minimum  preferred  speed  values  for 
this  case  Tlic  low  number,  and  consequently  the  low  elevations, 
represents  a  preference  for  slow  FM  sweeps  and  the  high  number 
and  coincident  peak  indicate  a  preference  for  fast  FM  sweeps.  Tlie 
(lotted  line  indicates  a  preference  for  medium  FM  speeds  (indicated 
by  the  number  2)  and  has  been  included  to  aid  in  interpreting  the 
maps.  As  illustrated,  cells  that  preferred  fast  speeds  were  located 
dorsully  and  were  followed  by  a  region  where  cells  picfencd  medi¬ 
um  speeds.  In  the  ventral  portion,  cell.s  responded  best  to  slow  FM 
sweeps.  In  the  lower  portion,  the  numbers  in  the  box  represent  the 
actual  obtained  preferred  speed  response  of  FM  sweep  for  a  given 
location,  as  determined  by  the  non-quantal  measurement 

projection  represent  the  prcfcnct!  speed  of  FM  sweep  for 
a  given  location  as  detemiined  by  the  iiondiscrctc  mea¬ 
surement  technique  rtcsvrilxid  in  Materials  and  methods. 
In  the  dorsal  aspect  of  this  map,  recording  locations  were 
marked  by  responses  showing  a  preference  for  fast  FM 
sweeps,  while  in  the  more  ventral  region,  slower  F.M 
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Fig.  7.  A  comparison  of  the  spatial  distribution  of  preferred  FM 
specu  determined  with  the  non-quantal  (A)  and  the  discrete  mea- 
suicment  (B).  Both  figures  are  pseudo  three-dimensional  plots  of 
preferred  speed  calculated  in  these  two  ways  (see  Materials  and 
methods).  Little  differeucc  between  the  two  metliods  was  apparent 
in  the  spatial  distribution  of  preferred  speed.  Conventions  are  iden¬ 
tical  to  those  in  Fig.  6 

sweeps  were  preferred.  Responses  to  mediutn-speed  FM 
sweeps  were  encountered  in  an  infermediate  region  be¬ 
tween  fast  and  slow  speed  tesponse  subregions.  This  spa¬ 
tial  distribution  for  preferred  FM  speed  vas  similar 
across  the  range  of  frequencies  examined. 

Figure  7 A  shows  the  distribution  of  prcfeiTcd  speed 
for  case  87-518  calculated  by  the  noadkcrctc  classifica¬ 
tion  method,  and  Fig.  7B  illustrates  the  distribution  de¬ 
termined  by  the  discrete  measurement  technique  of  this 
response  parameter  (see  Materials  and  methods).  The 
spatial  distributions  derived  from  both  of  these  methods 
were  similar  to  each  other  across  all  cases  examined.  The 
spatial  distribution  of  preferred  FM  speed  for  this  case- 
dirfered  somewhat  from  other  cases.  Sites  located  in  the 
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more  dorsal  and  ventral  regions  v/ere  distinguished  by 
preferences  for  medium  as  opposed  to  fast  FM  sweep 
speeds,  while  for  the  central  region  interposed  between 
these  extreme  dorsal  and  ventral  subregions,  the  more 
typical  dorsoventral  spatial  pattern  of  fast-raedium-slow- 
mcaiuni-fast  was  observed.  However,  in  concurrence 
with  the  other  cases,  the  preferred  FM  speed  responses 
for  the  map  iilustiated  in  Fig.  7  appeared  to  be  largely 
independent  of  CF  (7.0  kHz  <  CF  <  1 1.0  kHz). 

As  there  was  little  difference  between  the  spatial  distri¬ 
bution  of  preferred  speed  responses  as  determined  by  the 
nondiscrete  and  discrete  measurement  techniques,  only 
the  former  measurements  will  be  illustrated  in  the  re¬ 
maining  cases.  To  demonstrate  the  consistency  of  the 
findings,  as  well  as  evidence  for  intcrindividual  varia¬ 
tions,  Fig.  8  shows  four  additional  cases  of  the  distribu¬ 
tion  of  preferred  FM  speed  responses  in  AI..A11  of  the 
maps  contained  a  circumscribed,  elongated  depression 
corresponding  to  slow  FM  sweep  speed  preferences 
flanked  by  a  series  of  gradual  transitions  to  regions  con¬ 
taining  preferences  for  medium  or  fast  speeds.  Generally, 
the  slow-spced  preferences  were  located  between  the  low¬ 
er  and  middle  third  of  the  mapped  dorsoventral  dimen¬ 
sion  and  were  oriented  onhogonally  to  the  isofrequency 
contours.  Figure  8A  (86-b97)  provides  an  example  of  the 
spatial  disiribution  of  preferred  FM  speed  for  sites  with 
CFs  between  7.8  and  15.2  kHz.  In  the  most  dorsal  aspect 
of  this  map,  preferences  for  fast  and/or  medium  FM  rales 
were  observed.  Adjacent  to  this  region,  indicated  by  a 
peak  in  the  map,  cortical  locations  were  marked  by  pref¬ 
erences  for  fast  FM  sweeps.  A  second  maximum  {i.c,,  sub¬ 


Fig,  8A-D.  Additional  examples  of  (he 
spatial  distribution  of  preferred  speed  as 
determined  by  the  non-  quanul  measure. 
Pseudo  three^imensional  represenUtion  of 
preferred  speed  for  case  86-697  (A),  case 
87-706  (Bk  case  87-129  (Q,  and  for  case 
87-001  (D).  Note  that  the  orientation  of 
the  mapp^  area  for  case  87-001  (D)  was 
tilted  24“  relative  to  the  isofrequency  ori¬ 
entation,  resulting  in  an  approximate 
alignment  of  the  IS.S-kHz  contour  with 
the  ventroposterior  to  donoanterior  axis 
of  the  map.  In  all  four  cases  a  systematic 
distribution  of  preferred  FM  sp^  was  ob¬ 
served.  Conventions  are  identical  to  those 
in  Fig.  6 


region  of  fast  FM  sweep  selective  units)  was  located  in  the 
more  ventral  aspect  of  the  mop.  In  the  middle  of  the  map 
(represented  by  the  through),  responses  exhibited  a  pref¬ 
erence  for  slow  speeds.  Interpolated  between  the  fast  and 
slow  subregions  were  recording  sites  consisting  of  re¬ 
sponses  to  medium  FM  sweeps.  In  the  dorsal  and  central 
regions,  preferences  for  fast  or  medium  FM  sweeps  ap¬ 
peared  to  be  independent  of  CF.  In  the  ventral  region, 
however,  the  selectivity  for  fast  FM  sweeps  was  expressed 
more  strongly  for  lower  CFs  (c.g.,  around  7-8  kHz)  while 
locations  with  higher  CFs  {c.g.,  15.0  kHz)  appeared  to 
prefer  medium  sweep  rates. 

Figure  8B  (87-706)  shows  another  example  in  which 
recording  from  several  sites  (5.8  kHz  <  CF  <  17.3  kHz) 
in  the  dorsal  and  middle  regions  of  AI  yielded  a  distribu¬ 
tion  of  responses,  preferences  for  fast  and  medium  FM 
sweeps,  respectively.  In  contrast  to  other  maps,  the  ma¬ 
jority  of  sites  in  the  ventral  aspect  of  this  map  were  char¬ 
acterized  by  preferences  for  medium  rates  of  FM  sweeps, 
despite  the  fact  that  the  dorsoventral  extent  explored  in 
this  case  (5  mm)  was  comparable  with  most  other  cases. 

The  preferred  FM  speed  distribution  shown  in  Fig.  8C 
(87-129;  3.2  kHz  <  CF  <  19.3  kHz)  is  similar  to  the 
distribution  illustrated  in  Fig.  8A  in  that  only  the  vetitral 
representation  of  FM  speed  responses  appKtarcd  to  vary 
with  CF.  Those  recording  sites  where  neural  response 
preferences  for  fast  speeds  were  cticountered  were  clso 
marked  by  CFs  between  7.0  and  9.0  kHz,  while  lower  or 
higher  CFs  appeared  to  be  associated  with  preferences 
for  medium-speeds. 

In  Fig.  8D  (87-001)  the  oneriation  of  the  mapped  area 
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was  tilted  24°  relative  to  the  isofrequeucy  oiientation  (see 
Fig.  4  in  Schreiner  and  Mendelson  1990),  resulting  in  an 
approximate  alignment  of  the  18.5-kHz  contour  with  the 
vcntiopcsiorior  to  dorsoanterior  axis  of  the  map.  Con¬ 
sidering  only  the  most  complete  isofrcqucncy  contours 
(18.5--22.0  kHz)  it  becomes  apparent  that  the  preferred 
FM  speed  followed  the  same  general  patteni  of  spatial 
distribution  (i.c.,  fasl-medium-slow  medium-fast)  seen  in 
most  other  cases. 

In  summary,  preferences  for  FM  sweeps  of  different 
speeds  seemed  to  be  systematically  distributed  along  the 
dorsoventral  axis  of  the  high-frequency  region  of  Al.  In 
the  most  dorsal  aspects  of  Al,  the  majoiity  of  lecording 
sites  were  distinguished  by  responses  showing  a  prefer¬ 
ence  for  fast  and/or  medium  FM  sweeps.  In  this  region, 
appro.timately  equal  respon.'^es  to  both  fast  and  medium 
FM  sweeps  were  also  encountered.  If  one  considers  the 
progression  of  FM  speed  responses  encountered  in  elec¬ 
trode  penetrations  running  along  the  dursoventrul  axis  of 
Al,  responses  alternated  in  the  following  way:  first,  re¬ 
sponses  encountered  consisted  of  preferences  for  fast 
speeds,  then  medium-speeds,  and  then  slow  speeds,  fol¬ 
lowed  by  responses  that  preferred  medium  speeds  again, 
until  in  the  most  ventral  region  there  was  u  tendency  for 
multiunits  to  prefer  fust  FM  sweeps.  In  the  more  ventral 
aspect  of  Al  (and  (lossibly  including  the  dorsal  region  of 
AH),  more  variability  between  maps  wa.s  ob.scrvcd.  In  the 
majority  of  cases,  this  .spatial  pattern  of  preferred  speed 
distribution  ap^ieared  to  be  oriented  orthogonally  to  the 
isoficqueney  domain  oi  Al  and  co  jlc  he  a'nsidcrcd  to  be 
largely  independent  of  CF. 


Rapresentation  of  direction  selectivity 

The  distribution  of  preferred  responses  to  each  direction 
of  FM  sweep  for  individual  cases  and  for  all  case.s  com¬ 
bined  is  shown  in  Table  2.  In  general,  at  ju.sl  o/cr  half  of 
I  the  cortical  locations  examined  (55,4%),  no  preference 

I  for  the  direction  of  FM  sweep  was  observed  (using  a 

criterion  of  ±0.15;  sec  Materials  and  methods).  In  one 
case  (87-129),  almost  75%  of  the  recording  sites  con¬ 
tained  responses  classified  as  non-directior.-selectivc.  Of 
the  total  number  of  recording  sites  for  all  casc.s  where 
direction  selective  responses  were  found,  almost  twice  as 
I  many  preferred  downward-  to  upward-directed  FM 

sweeps.  For  two  out  of  six  cases  (87-706  and  87-129),  the 
reverse  trend  was  observed,  consisting  of  a  slightly  higher 
percentage  of  responses  preferring  upward-  as  c-pposed 
to  downward-directed  FM  .sweeps. 

Figure  9  shows  a  scattc  plot  of  direction  seltctiviiy  as 
a  function  of  CF  for  two  cases.  The  two  dotted  lines, 
corresponding  to  ±0.15  (sec  Materials  and  methods), 
s  represent  the  chosen  classification  boundaries  lor  direc¬ 

tion-selective  rcspoa.scs.  Thus,  points  falling  between 
these  lines  were  classified  as  nou-dircction-selective  while 
those  fulling  outside  were  classified  as  direction  selective 
(negative  numbers  represent  downward -^lircctcd  re¬ 
sponses;  positive  numbers  rcprc.scnt  upward-dircctfid  re¬ 
sponses).  While  the  range  of  CFs  tested  for  case  87-518 
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Fi'Z.  Scatter  plots  of  dirccuun  selectivity  ui  u  function  of 
clinritciettsiiu  frequency  (CF).  AU  of  the  dircctloii-selcinive  index 
vuides  are  plotted  as  a  function  of  CF  for  u4k  87-.S18  (A)  and  87-001 
(B).  yositive  values  refer  to  a  preference  for  upward-directed  FM 
sweeps,  while  the  negative  Mlues  refer  to  a  preference  for  down¬ 
ward-directed  (’M  sweept.  Tlie  dotted  line.e  indicate  the  boundaries 
(±0.1C)  for  direction-selectivity  ctassiriciuion.  Symbols  between  the 
dotted  lines  rcpieseni  unit  dusters  that  were  non-direction-sclcctive. 
In  Heneral.  the  dcu>ce  of  direction  selectivity  up[>eared  to  be  inde¬ 
pendent  of  CF 


r*bk  2.  Distribution  of  preferred  direction  of  frcqucncy-moduiatcd 
sweeps 


Cat 

Number 
of  sweeps 

Upward 

(Vi) 

Downward 

(%) 

No  preference 
(•/.) 

86-697 

88 

9.1. 

42.0 

48.0 

87-420 

63 

15.9 

27.0 

57.1 

87-518 

82 

il.5 

50.0 

42.5 

87.706 

86 

23.3 

17.4 

59.3 

87-12y 

81 

14.8 

11.1 

74.1 

87-001 

96 

20.8 

26.0 

53.1 

Totai 

496 

15.5 

29.0 

55.4 

75 


Fig.  10.  llie  jpatial  distribution  of  direction-selective 
responses.  As  with  the  pseudo  three-dimensional  plots 
depicting  pnderred  speed,  in  the  upper  portion  of  the 
Tigur:  two  dimensions  of  the  projection  represent  the 
anteroposterior  (a,p)  and  dorsoventral  (d,i>)  extent  of 
the  cortical  suifacc.  The  arro'wheads  along  the  antero¬ 
posterior  dimension  indicate  the  approximate  orienta¬ 
tion  of  the  isofiequency  contours.  The  elevation  of  the 
surface  of  the  plot  corresponds  to  the  direction  selec¬ 
tivity  Bctoss  the  cortical  surface.  An  additional 
smoothing  factor  of  0.9  was  applied  to  the  data.  The 
numbers  along  the  elevational  axis  refer  to  the  direc¬ 
tion-selective  index  values  for  this  case.  The  negative 
number  and  corresponding  low  elevations  represent  a 
preference  for  downward-directed  FM  sweeps.  The  pos¬ 
itive  number  and  corresponding  pcaVs  represent  a  pref¬ 
erence  for  upward-directed  FM  sweeps.  The  dotted  line 
represents  a  direction-selective  index  vame  of  0.  In  the 
lower  portion,  the  numbers  In  the  box  represent  the  ac¬ 
tual  diroction-selecttve  index  values  obtained  for  each 
location.  For  this  cose,  cells  that  responded  best  to  up¬ 
ward-directed  FM  sweeps  weie  located  in  the  dorsal 
and  the  ventral  regions,  while  downward-directed  se¬ 
lective  cells  were  located  in  the  central  region 


fFig.  9A)  v/as  much  narrower  than  for  case  87-0()l  (Fig. 
9B).  both  scatter  plots  suggest  that  direction  selectivity 
was  independent  of  CF. 

In  general,  it  was  found  that  preferences  for  one  FM 
sweep  direction  were  spatially  segregated  from  responses 
for  the  opposite  direction.  Figure  10  (cu.se  87-420)  shows 
an  example  of  the  spatial  distribution  oi  direction  selec¬ 
tivity  across  Al.  As  in  the  pseudo  three-dimensional  plots 
for  preferred  FM  speed  (Figs,  (i-8),  the  X-  and  Y-axes 
represent  the  anteroposterior  and  dorsoventral  extent  of 
Al,  respectively.  In  addition,  surface  elevation  (Z-axis) 
represents  the  direction-selective  index  value  obtained  at 
each  cortical  location  examined  within  this  area.  The 
dotteil  line  represents  a  direction-selective  index  value  of 
0.  Below  the  pseudo  three-dimensional  plot  is  a  box  rep¬ 
resenting  the  cortical  locations  sampled.  The  numbers  in 
the  box  indicate  the  direction-selective  index  value  com¬ 
puted  for  responses  observed  at  each  location  (negative 
values  represent  a  preference  for  downward-directed  FM 
sweeps  and  positive  values  represent  a  preference  for  up¬ 
ward-directed  FM  sweeps).  In  Fig.  10,  neurons  located  in 
the  dorsal  region  showed  little  or  no  direction  selectivity. 
Responses  recorded  around  the  10  kHz  region  displayed 
a  weak  preference  for  upward-directed  FM  sweeps.  In  the 
mid-region,  a  higher  incidence  of  selectivity  for  down¬ 
ward-directed  sweeps  was  observed,  while  in  the  most 
ventral  aspect  oi  this  map,  most  units  exhibited  a  prefer¬ 
ence  for  upward-directed  FM  sweeps. 


Figure  1 1  shows  three  additional  examples  of  the  spa¬ 
tial  distribution  of  direction  selective  responses  across 
Al.  Figure  11 A  (87-518)  shows  a  map  in  which  prefer¬ 
ences  for  upward-directed  FM  sweeps  were  confined  ex¬ 
clusively  to  the  dorsal  region  of  Al.  In  the  ventral  regii  .1, 
most  responses  were  non-dircction-sclective.  In  the  mid¬ 
dle  region,  preferences  for  downward-directed  FM 
sweeps  were  prevalent.  For  this  example,  direction-  and 
non-dircction-selcctivc  responses  appeared  to  be  inde¬ 
pendent  of  CF- 

Thc  map  illastratcd  in  Fig.  IIB  (86-697)  is  unique  in 
that  there  were  three  distinct  regions  of  downward-direc¬ 
tion  selective  FM  responses  and  three  regions  distin¬ 
guished  by  a  concentration  of  upward  direction-selective 
responses.  Further,  the  range  of  direction-selective  index 
values  (-0.82  to  -i- 1.0)  was  slightly  greater  than  for  other 
cases.  In  the  dorsal  and  mid-regions  of  the  surface  map, 
direction  selectivity  appeared  to  be  dependent  upon  CF. 
Upward-directed  responses  were  found  at  higher  CFs 
(around  13.0-15.0  kHz)  in  the  dorsal  region  and  at  lower 
frequencies  (8.0-10.0  kHz)  in  the  middle  region. 

In  the  dorsal  region  of  case  87-001  (Fig.  1 1C)  cells  with 
lower  CFs  responded  best  to  upward-directed  FM 
sweeps,  while  cells  with  higher  CFs  responded  best  to 
downward-directed  FM  sweeps.  In  the  mid-region  of  the 
naap,  responses  were  strongly  selective  for  downward-di¬ 
rected  FM  sweeps  and  were  largely  independent  of  CF 
In  the  vcntrpl  aspect  of  the  map,  a  preference  for  down- 
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ward-dircctcd  FM  sweeps  was  also  apparent,  particular¬ 
ly  for  units  with  CFs  between  15.0  and  26.3  kHz,  al¬ 
though  the  degree  of  direction  selectivity  was  reduced 
relative  to  the  middle  region.  Low-frequency  sensitive  re¬ 
sponses  encountered  in  the  ventral  posterior  region  of 
this  map  were  not  selective  for  FM  sweep  direction. 

A  comparison  of  all  the  cases  shown  in  Figs,  10  and  1 1 
suggests  that  the  spatial  organization  of  direction-selec¬ 
tive  FM  responses  may  obey  similar  rules.  However,  the 
variability  in  the  global  representation  of  these  response 
properties  appears  to  be  greater  than  was  observed  for 
FM  sweep  speed. 

In  summary,  examination  of  dirct-iional  preferences 
for  FM  sweeps  over  a  wide  area  of  i  ortex  showed  evi¬ 
dence  of  a  systematic  distribution  along  the  dorsovcntral 
axis  of  the  high-frequency  region  of  AI,  in  the  dorsal 
region  of  AI,  most  direction-selective  response  prefer- 


15.2  kHz 


Fig.  lU-D.  Additional  examples  of  the  spatial  distribution  of  di¬ 
rection  selectivity.  A  Pseudo  three-  dimensional  representation  of 
direction  selectivity  for  cate  87-518.  B  Spatial  distribution  of  di¬ 
rection  selectivity  for  case  86-697  represented  as  a  pseudo  three- 
dimensional  projection.  C  Pseudo  three-dimension^  representa¬ 
tion  of  dircaion  selectivity  for  case  87-001.  Mote  that  the  orienta¬ 
tion  of  the  mapped  area  for  this  case  was  tilted  24'*  relative  to  the 
isofrequency  orientation.  All  three  cases  showed  a  nonrandom 
spatial  distribution  of  direction-selective  FM  sweep  responses. 
Conventions  are  identical  to  those  in  Fig.  10 


cnccs  occurred  for  upward-directed  FM  sweetps,  while  in 
the  middle  region  the  majority  of  responses  showed 
strong  preferences  for  downward-directed  FM  sweeps. 
These  directional  preferences  generally  appeared  to  be 
independent  of  CF,  although  some  non-uniformity  in  the 
distribution  along  the  frequency  gradient  was  evident. 
Toward  the  ventral  aspect  of  AI,  the  preference  for  down¬ 
ward-directed  sweeps  gradually  declined  leading  to  either 
no  direction  selectivity  or  to  a  preference  for  upward-di¬ 
rected  FM  sweeps.  In  some  ventral  locations,  the  direc¬ 
tional  preference  seemed  to  be  dependent  upon  CF. 


Direction  selectivity  as  a  function  of  rate  of  FM  sweep 


In  the  previous  section,  di'  ection  selectivity  was  associat¬ 
ed  with  maximal  respon:.(.s  (see  Materials  and  methods) 
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and  thus  was  considered  without  specific  reference  to  the 
possible  effects  FM  sweep  speed  might  play  in  the  gener¬ 
ation  of  direction-selective  responses.  In  this  section,  the 
distribution  of  direction-selective  responses  is  re-exam- 
iacd  using  three  different  speeds  of  FM  sweep.  Thus,  the 
contribution  of  sweep  speed  to  direction  selectivity  of 
FM  sweep  could  be  estimated  by  calculating  independent 
direction-selective  index  values  for  each  set  of  responses 
to  fast,  medium,  and  slow  FM  sweeps  for  each  case. 

The  spatial  distribution  of  direction-selective  respons¬ 
es  illustrated  in  Fig.  12  (87-420)  was  obtained  using  the 
same  speed  conditions  that  were  used  to  generate  the 
results  shown  in  Fig.  10.  At  medium  and  fast  speeds,  the 


Fig.  12.  Spatial  distribution  of  direction  selectivity  as  a  function 
of  speed  of  FM  jweep;  the  pseudo  three-dimensional  representa¬ 
tions  of  direction  selectivity  as  determined  for  each  of  the  three 
sweep  speeds  used  in  this  study.  Independent  calculations  of  the 
direction-selectivity  index  were  made  for  responses  to  fast,  medi¬ 
um,  and  slow  FM  sweeps.  These  spatial  distributions  of  direc¬ 
tion-selective  responses  are  the  individual  speed  conditions  of  the 
case  based  on  the  strongest  response  shown  in  Fig.  10.  The 
luiijest  deviation  from  the  spatial  distribution  obtained  with  the 
strongest  response  occurred  with  slow  FM  sweeps.  Conventions 
arc  identical  to  those  in  Fi.i^  10 


more  common  biniodal  distribution  of  upward  direction- 
selective  responses  was  apparent.  For  the  medium-speed, 
this  prcfcrccoe  for  upward-directed  FM  sweeps  was  more 
frequently  observed  in  the  dorsal  and  especially  in  the 
ventral  region  'vhen  compared  with  the  distribution  of 
direction  selectivity  based  on  the  strongest  response 
(4-0.82  in  the  medium-speed  plot  vs  4-0.49  in  the 
strongest-respoase  plot),  lite  cortical  distribution  of  di¬ 
rection-selective  responses  assessed  with  fast  FM  speeds 
aiso  differed  from  the  distribution  based  on  the  strongest- 
response  criterion  in  the  dorsal  region,  where  direction- 
selective  respons^ts  were  less  frequently  encountered.  The 
preferences  for  downward-directed  FM  sv^eeps  concen- 


78 


«B7'00I 


0  40.59 


o  0.0 


-  -O.OS 


trated  in  the  mid-region  of  AI,  as  determined  using  both 
medium  and  last  FM  sweep  rates,  was  similar  to  the  dis¬ 
tribution  based  on  the  strongest  response.  The  distribu¬ 
tion  of  direction-selective  responses  for  slow  FM  sweeps 
differed  from  the  strongest  response  distribution  in  three 
respects.  First,  relatively  few  multiunit  responses  at  any 
site  showed  a  preference  for  upward-directed  sweeps. 
Second,  responses  at  sites  in  the  dorsal  region  where  re¬ 
sponses  displayed  CFs  ranging  from  11.0  to  12.0  kHz 
also  exhibited  strong  preferences  for  downward-directed 
FM  sweeps.  Third,  the  magnitude  of  direction  selectivity 
was  reduced  at  some  of  the  same  locations  where  re¬ 
sponses  showed  marked  preferences  for  fast  and  medium 
speeds. 

Figure  13  (87-001)  shows  another  example  of  the  dis- 
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Fig.  13.  A  second  example  of  the  spatial  distdbution  of  ditcction 
selectivity  as  a  function  of  speed  of  FM  sweep.  The  three  pseudo 
three-dimensional  representations  of  direction  selectivity  based  on 
the  individual  speeds  are  for  the  strongest  response  case  shown  in 
Fig.  1 1C  Note  that  the  orienution  of  the  mapped  area  for  this 
case  was  tilted  24"  relative  to  the  isofrequency  orientation,  result¬ 
ing  in  an  approximate  alignment  of  the  IS.S-kHz  contour  with 
the  vcntropostcrior  to  dorsoantcrior  axis  of  the  map.  Ttie  spatial 
distribution  of  direction  selectivity  obtained  with  slow  FM 
sweeps  differed  the  most  from  that  obtained  with  the  suongest 
response.  For  conventions  see  Fig.  10 


tribution  of  direction-selective  responses  as  a  funaion  of 
the  individual  FM  speed  conditions  (see  Fig.  1  iC).  Of  the 
three  speeds  examined,  the  distribution  of  direction-se¬ 
lective  responses  tested  with  fast  FM  sweeps  more  closely 
resembled  the  distribution  of  direction  sdectivity  based 
on  the  strongest  response.  The  plots  for  medium  and 
slow  FM  sweeps  differed  most  in  the  ventral  region  from 
the  direction  selectivity  distribution  of  the  strongest  re¬ 
sponse.  As  with  the  case  shown  in  Fig.  12,  the  distribu¬ 
tion  of  direction-selective  responses  tested  with  slow  FM 
sweeps  differed  most  from  the  other  two  FM  speed  distri¬ 
butions  as  well  as  from  that  derived  from  the  strongest 
response. 

In  summary,  the  spatial  distribution  of  direction-se¬ 
lective  r(5sponses  to  FM  sweeps  depended  on  the  sweep 
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Fig.  UA.B.  Comparison  of  direction  selectivi¬ 
ty  {DS]  and  preferred  speed  {PS).  The  contour 
plots  on  the  left  represent  the  spatial  distribu¬ 
tion  of  direction  .selectivity.  Tlie  hatching  de¬ 
picts  those  cortical  regions  where  units  re- 
s|>onded  best  to  downward-directed  FM 
sweeps  (DS  index  <-0.16  and  DS  index 
<-0.2,  respectively).  The  contour  plots  on  the 
right  represent  the  .spatial  distribution  of  the 
preferr^  FM  speed.  The  hatching  depicts 
those  cortical  areas  where  locations  preferred 
fast  FM  sweep  speeds  (PS  index  >2.2  or  2.0, 
res|5ectively).  The  two  dimensions  of  the  plot 
represent  the  anteroposterior  (a,p)  and 
dorsoventml  (d.v)  extent  of  the  cortical  surface. 
A  Direction-selective  and  preferred-speed  con¬ 
tour  plots  for  case  87-518.  The  ilasheJ  line  de¬ 
picts  an  isofrequency  contour  of  10.0  kHz.  B 
Direction-selective  and  preferred-speed  con¬ 
tour  plots  for  case  87-001.  The  isofrequency 
contour  for  18.0  kHz  is  represented  by  the 
liashed  line.  The  orientation  of  the  mapped 
area  for  this  case  was  tilted  24°  relative  to  the 
isofrcquency  orientation.  Note  that  the  nearly 
complementary  distribution  of  locations  with 
preference  for  downward-directed  sweeps  and 
fast  sweep  rates  was  observed  in  both  cases 


Kpucii  usctl  to  elicit  tlic  response  und  dirTcred  somewhat 
from  the  distribution  based  on  the  strongest  response. 
Direction-selective  response  distributions  determined 
with  fast  und  medium  rates  of  FM  sweeps  showed  closer 
agreement  with  those  based  on  the  strongest  response 
criterion,  especially  in  the  dorsal  and  middle  regions  of 
Al.  However,  the  greatest  dirference  between  the  spatial 
distributions  of  direction-selective  responses  assessed  us¬ 
ing  these  criteria  was  observed  in  the  case  of  slow  FM 
sweeps.  Particularly  notable  in  the  slow  FM  speed  map 
was  a  reduction  in  the  degree  of  direction  selectivity  for 
the  more  centrally  located  responses  compared  with  the 
strong,  downward  direction-selective  responses  that  oc¬ 
cupy  this  region  of  the  map  when  faster  FM  speeds  arc 
used. 


Correlation  of  direction  selectivity  and  preferred  speed 

An  attempt  was  made  to  correlate  the  spatial  distribution 
of  preferred  speed  with  that  of  direction  selectivity.  The 
contour  plots  in  Fig.  14  illustrate  the  relationship  be¬ 
tween  the  spatial  distributions  of  direction-selective  and 
preferred-speed  responses  for  two  cases.  The  dashed  line 
represents  the  orientation  of  an  isofrcquency  contour 
running  through  the  center  of  each  map.  In  the  two  cases 
illustrated,  87-518  (Fig.  14 A)  and  87-001  (Fig.  14B),  the 
contour  plots  on  the  left  of  each  panel  show  the  spatial 
distribution  of  direction-selective  responses  while  the 
plots  on  the  right  show  the  .spatial  distribution  of  pre¬ 
ferred  FM  speed  responses.  The  shaded  areas  in  the  con¬ 
tour  plots  for  direction  selectivity  (DS)  demarcate  those 
regions  where  multiunit  responses  reflect  a  preference  for 
downward-directed  FM  sweeps  (DS  <-0.16  and  DS 
<-0.2,  respectively)  while  the  shaded  areas  in  the  pre¬ 


ferred  FM  speed  (PS)  plots  illustrate  regions  where  the 
best  response  was  to  fust  FM  sweeps  (PS  >2.2  and  PS 
>  2.0,  respectively).  As  can  be  seen  in  Figs.  14A  and  D, 
there  wus  little  overlap  between  the  areas  containing 
downward-  direction-selective  and  fast  speed  responses 
to  FM  sweeps  for  the  two  cases.  This  observation  sup¬ 
ports  the  notion  of  u  covanunce  of  direction  selectivity 
and  preferred  speed:  recording  sites  in  which  multiunit 
responses  prefer  fast  FM  sweeps  coincide  with  locations 
of  multiunit  responses  preferring  upward-directed  FM 
sweeps  and  vice  versa.  However,  significant  deviation 
from  this  pattern  may  be  seen  in  the  upper  third  and  in 
the  ventral  extremes  of  the  mapped  areas.  This  suggests 
that  the  covariance  between  direction  and  speed  selectiv¬ 
ity  may  not  be  homogeneously  distributed  across  the  en¬ 
tire  cortical  area  examined  by  these  studies.  This  is  also 
supported  by  the  results  of  a  point-by-point  correlational 
analysis  of  direction  and  speed  selectivity.  Table  3  shows 
that  no  statistically  significant  correlation  could  be  found 
between  direction  and  speed  selectivity  across  all  points 
in  each  map.  Regional  correlations  for  preferred  speed 
and  direction  were  carried  out  independently  in  the  dor- 


T»blc  3,  Corrdalion  of  preferred  speed  and  preferred  direction  of 
frequency-modulated  sweep 


Cal 

Number  of 
sweeps 

Correlation  P 

(r) 

86-697 

88 

0.07 

0.51 

87-420 

63 

0.04 

0.78 

87-518 

81 

0.08 

0.49 

87-706 

83 

0.02 

0.85 

87-129 

81 

0.20 

0.07 

87-001 

96 

0.12 

0.25 

80 


sal  (i.e.,  the  portion  of  AI  dorsal  to  the  most  sharply 
tuned  multiple  unit  responses;  see  Schreiner  and  Mendel- 
son  1990)  and  ventral  (i.e.,  locations  ventral  to  the  most 
sharply  tuned  region  of  AI)  subregions  of  each  map  also 
yielded  no  statistically  signiGcant  correlations  between 
these  response  properties. 


Correlation  of  direction  selectivity 

and  preferred  speed  with  excitatory  bandwidth 

and  broadband  transient  response 

In  general,  there  was  a  tendency  for  multiunit  responses 
recorded  in  the  more  dorsal  and  ventral  regions  of  AI  to 
respond  best  to  fast  or  medium  FM  sweeps  changing  in 
an  upward  direction,  while  multiunits  examined  in  the 
middle  region  responded  best  to  downward*directed  and 
slower  FM  sweeps.  In  u  previous  paper,  Schreiner  and 
Mendelson  (199Uj  showed  that  in  the  dorsal  and  ventral 
regions  of  AI,  cells  had  low  Qiodu  and  Q40du  values  and 
responded  best  to  broadband  transient  stimuli.  Cells  lo¬ 
cated  in  the  middle  region  were,  in  contrast,  narrowly 
tuned  and  responded  poorly  to  broadband  stimuli.  Thus, 
in  an  attempt  to  reveal  the  neural  mechanisms  underly¬ 
ing  FM  sweep  responses,  the  relationships  between  exci¬ 
tatory  bandwidth  and  broadband  transient  response 
with  preferred  P'M  sweep  speed  and  direction  were  exam¬ 
ined.  To  this  end,  statistical  correlations  between  these 
response  parameters  were  analyzed  separately  for  cells 
located  in  the  dorsal  region  (Aid)  and  the  ventral  region 
(AIv)  of  the  auditory  cortex.  In  addition,  statistical  corre¬ 
lations  were  also  examined  for  responses  sampled  over 


Table  4.  CorreUUon  of  nreferred  frequency-modulated  speed  with 
integrated  excitatory  bandwidth  and  response  strength  to  broad¬ 
band  transient  signals 


Case 

AI 

Aid* 

Alv*’ 

r 

P 

r 

P 

r 

P 

QtOJU 

87-518 

-0.26 

0.03 

-0.38 

0.01 

87-706 

-0.20 

0.05 

87-001 

-0.24 

0.02 

-0.31 

0.02 

Q«dll 

87-518 

-0.56 

O.OOOl 

-0.50 

O.OOl 

-0.58 

0.0002 

87-001 

-0.29 

0.005 

-0.29 

0.02 

BTS 

87-518 

0.47 

0.0001 

0.58 

0.0001 

0.40 

0.01 

87-706 

0.22 

0.05 

87-001 

0.45 

0.0001 

0.44 

0.0005 

0.44 

0.008 

Linear  regression  analysis;  r,  correlation  coeiricient;  P,  level  of  signi¬ 
ficance  (f-test) 

*  Aid  represents  recording  locations  in  the  portion  of  AI  dorsal  to 
the  most  sharply  tuned  multiple  unit  responses 
•’  Alv  repre.scnts  locations  ventral  to  the  most  sharply  tuned  region 
of  AI  (Schreiner  and  Mendelson  1990) 

BTS.  broadband  transient  signals:  AI.  primary  auditory  cortex;  Q, 
excitatory  bandwidth 


Tabic  5.  Correlation  of  direction  selectivity  for  frequency-modula¬ 
ted  sweep  and  integrated  cxdtatoiy  bandwidth 


Case 

A! 

Aid 

Alv 

r 

P 

r 

P 

r 

P 

Qioua 

87-518 

-0.25 

0.025 

-0.33 

0.03 

87-706 

-0.30 

0.005 

-0.36 

0.01 

87-001 

-0.33 

0.001 

-0.32 

0.01 

-0.38 

0.002 

Q«>dB 

87-518 

-0.49 

0.0001 

-0,36 

0.02 

-0.57 

0.0003 

87-001 

-0.29 

0.004 

-0,32 

0.01 

Linear  regression  analysis:  r,  conelation  coeiTicient;  P,  level  of  signi¬ 
ficance  (F-test)..  For  further  explanation  sec  Table  4 


the  entire  cortical  region  (AI)  in  the  three  cases  for  which 
complete  Qk^b  Qsoub  available. 

Tlie  correlation  of  integrated  excitatory  bandwidth 
and  broadband  transient  response  with  preferred  FM 
speed  is  shown  in  Table  4.  Only  statistically  significant 
correlations  have  been  indicated.  As  can  be  seen,  there 
was  a  strong  correlation  between  preferred  FM  speed 
and  excitatory  bandwidth  across  the  entire  region  of  cor¬ 
tex  examined  in  these  experiments.  This  correlation  indi¬ 
cates  that  units  with  higher  Q,(ub  and  Q40JB  (i-c»  units 
that  were  narrowly  tuned)  respond  best  to  slower  FM 
sweeps  and  vice  versa.  When  examined  as  a  function  of 
cortical  subregioiL,  there  was  a  stronger  correlation  be¬ 
tween  these  response  parameters  and  preferred  FM  speed 
in  the  dorsal  than  in  the  ventral  subregion.  For  broad¬ 
band  stimulus  response  and  preferred  FM  sweep  speed,  a 
strong  correlation  was  also  obsei-ved  in  both  halves  of  the 
cortical  regions  studied.  Thus,  units  that  respemded  best 
to  faster  speeds  of  FM  sweeps  also  seemed  to  respond 
well  to  u  broadband  transient  stimulus  and  vice  versa.  In 
all  three  cases,  there  was  u  significant  correlation  between 
FM  direction  selectivity  and  integrated  excitatory  band¬ 
width  over  the  entire  extent  of  the  mapped  cortical  region 
(Tabic  5).  In  particular,  significant  correlations  wem  ob¬ 
served  for  multiunit  responses  obtained  in  the  dorsal 
rather  than  the  ventral  region  of  the  cortex.  Unlike  pre¬ 
ferred  speed  of  FM  sweep,  there  was  no  significant  corre¬ 
lation  between  direction  selectivity  and  responses  to 
broadband  transient  signals. 


Discussion 

The  primary  purpose  of  the  present  study  was  to  deter¬ 
mine  the  systematic  spatial  distribution  of  neural  re¬ 
sponses  in  the  cat  AI  to  the  direction  and  speed  of  FM 
sweeps.  For  these  two  FM  sweep  parameters,  a  nonran¬ 
dom  distribution  along  the  isofrequency  dimension  was 
observed.  Extensive  mapping  of  area  AI  revealed  that 
response  preferences  to  fast  and/or  medium  FM  speeds 
were  usually  encountered  dorsally,  slow  FM  spced-sclcc- 
tive  responses  were  found  centrally,  and  fast  speed-selec¬ 
tive  responses  were  located  vcntrally  within  AI.  Medium- 
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speed  selective  responses  were  juxtaposed  between  the 
fast  and  slow  speed  selective  regions,  supporting  the  con¬ 
clusion  that  there  is  an  orderly,  functional  transition  in 
speed  tuning  along  the  iscfrequency  domain  of  AI.  For 
most  direction-selective  responses,  preferences  for  up¬ 
ward-directed  FM  sweeps  were  found  in  dorsal  and,  to  a 
lesser  degree,  in  ventral  subregions  of  AI  as  well,  while 
preferences  for  downward-directed  FM  sweeps  were 
found  in  the  central  region  of  AI.  Although  the  size  and 
extent  of  speed-  and  direction-selective  response  regions 
varied  to  some  extent  across  cases,  the  overall  distribu¬ 
tion  of  these  response  parameters  within  AI  suggests  that 
in  addition  to  tonotopicity,  the  functional  organization 
of  AI  may  also  be  defined  by  an  orthogonal  dimension  of 
FM  sweep  responses  within  the  isofrcqucncy  domain. 

Before  discussing  these  results  further,  two  prelimi¬ 
nary  issues  should  be  addressed.  First,  since  multiunit 
recording  samples  the  activity  of  several  neurons  within  a 
small  volume,  it  cannot  be  assumed  that  all  neurons  in¬ 
volved  have  identical  response  properties  (Schreiner  and 
Mcndelson  1990).  Furthermore,  the  contribution  of  tha- 
lumocoriicul  afferents  to  local  cortical  responses,  al¬ 
though  probably  minimal,  cannot  be  completely  exclud¬ 
ed  (Schreiner  and  Cynader  1984).  Therefore,  cortical  mul¬ 
tiunit  responses  should  be  interpreted  as  a  local  extracel¬ 
lular  manifestation  of  integrated  cortical  processes  oper¬ 
ating  at  a  given  location  rather  than  a.s  a  faithful  reflec¬ 
tion  of  underlying  single-unit  responses  (Schreiner  and 
Mendelson  1990).  Accordingly,  both  direction-selective 
and  speed-selective  multiunit  responses  recorded  at  a 
given  cortical  site  reflect  the  dominant  pattern  of  neural 
activity  resulting  from  local  cortical  processing  at  that 
location.  The  systematic  variation  in  FM-selcctivc  multi¬ 
unit  responses  observed  along  the  dorsoventral  extent  of 
AI  is  inon  easily  explained  as  the  consequence  of  intrin¬ 
sically  distributed  cortical  processes  rather  than  as  the 
result  of  random  variation  in  multiunit  responses.  There¬ 
fore,  because  FM-sclectivc  responses  varied  systematical¬ 
ly  as  a  function  of  cortical  location,  it  is  more  likely  that 
these  variations  reflect  the  global  operation  of  extensive 
cortical  networks,  rather  than  location-independent,  ran¬ 
dom  variations  in  mulliumt  activity  (Schreiner  and 
Mcndelson  1990). 

The  second  issue  is  concerned  with  the  possible  sys¬ 
tematic  influences  of  the  stimulus  delivery  system.  Irregu¬ 
larities  in  the  stimulus  delivery  system  may  create  poten¬ 
tial  contributions  to  the  recorded  responses  such  as,  for 
example,  AM  components  in  the  delivered  signal.  Given 
the  broad  FM  .sweeps  used  in  the  present  study  (from 
0.25  to  64.0  kHz)  and  the  known  irregularities  in  the 
transfer  function  of  the  sound  delivery  system  (see  Mate¬ 
rials  and  methods),  it  is  likely  that  the  FM  sweeps  used  in 
these  experiments  also  contained  an  AM  component,  es¬ 
pecially  at  higher  frequencies.  However,  as  pointed  out 
by  Suga  (1968),  the  effect  of  the  potentially  confounding 
amplitude  component  on  the  neural  responses  encoun¬ 
tered  is  likely  to  be  minimal,  since  the  possible  AM  con¬ 
tributions  would  be  similar  for  most  stimuli  (eg.,  increas¬ 
ing  AM  component  with  a  downward-directed  FM 
sweep  and  decreasing  AM  component  with  an  upward- 
directed  FM  sweep),  while  neural  responses  to  FM  stim¬ 


uli,  observed  in  the  present  study,  systematically  differed 
from  one  another. 

Furthermore,  if  our  FM  data  were  greatly  confounded 
by  an  AM  component,  then,  based  on  the  results  of 
Phillips  and  Hall  (1987),  one  would  predict  that  the  ma¬ 
jority  of  cortical  cells  would  respond  most  vigorously  to 
downward-directed  FM  sweeps.  This  prediction  is  based 
on  the  premise  that  the  high-frequency  roll-off  in  the 
transfer  function  of  most  sound  delivery  systems  implies 
that  downward-directed  FM  sweeps  v/ill  occasion  a  stim¬ 
ulus  that  increases  progressively  in  amplitude  as  it  de¬ 
creases  in  frequency.  Phillips  and  Hall  (1987)  showed 
that,  with  the  exception  of  cells  with  off  responses,  most 
cells  (87%)  responded  to  increases  in  stimulus  intensity. 
However,  55%  of  the  unit  clusters  sampled  in  the  present 
study  did  not  respond  selectively  to  either  direction  of 
FM  sweep.  In  addition,  of  the  direction-selective  multi¬ 
unit  responses,  some  preferred  upward-directed  while 
others  preferred  downward-directed  FM  sweeps.  Thus, 
these  considerations  strongly  suggest  that  nonspectral 
contributions  to  the  FM-selcctive  responses  reported 
here  were  minimal  and,  tnerefore,  do  not  seriously  com¬ 
promise  the  interpretation  or  the  generality  of  these  find¬ 
ings. 


Cortical  depth  and  FM  sweep  selectivity 

Cells  recorded  along  a  single  electrode  penetration  (ori¬ 
ented  perpendicular  to  the  cortical  surface)  displayed 
similar  preferences  for  the  speed  and  direction  of  FM 
sweeps.  This  was  clearest  at  depths  between  650  and  2000 
pm  below  the  cortical  surface.  Despite  the  relatively  small 
number  of  long  penetrations  investigated  in  the  present 
study,  the  similarity  of  responses  recorded  at  diverse 
depths  suggests  that  FM  speed-  and  direction-selective 
responses  arc  column-dependent  properties.  This  is  con¬ 
sistent  with  the  single-unit  study  of  Mcndelson  and  Cy- 
nadcr  (1985)  in  which  they  reported  that  cells  within  an 
electrode  penetration  perpendicular  to  the  cortical  sur¬ 
face  exhibited  similar  FM  sweep  speed  and  direction-se¬ 
lective  responses.  Furthermore,  other  response  properties 
such  as  CF,  binaural  interaction,  time-delay  sensitivity, 
and  threshold  have  also  been  found  to  exhibit  similar 
columnar-dependent  properties  (Suga  1965a;  Brugge  and 
Mcrzenich  1973;  Imig  and  Adrian  1977). 

Stimulus  level  and  FM  sweep  responses 

The  cfi'ecl  of  stimulus  level  on  preferred  speed  and  direc¬ 
tion  selectivity  was  statistically  significant.  This  is  per¬ 
haps  not  surprising  given  that  excitatory  bandwidth  tun¬ 
ing  is  also  known  to  be  affected  by  stimulus  level 
(Schreiner  and  Mcndelson  1990).  In  addition,  as  we  have 
demonstrated  here,  integrated  excitatory  bandwidth  is 
significantly  correlated  with  preferred  speed  and  direc¬ 
tion  selectivity  (sec  Tables  4,  5).  Although  significant,  the 
slopes  of  the  linear  regression  lines  litted  to  both  pre¬ 
ferred-speed  and  direction-selectivity  data  were  relatively 
shallow.  This  suggests  that  the  trend  toward  faster  speed 
selectivity  and  downward  direction  selectivity  for  FM 


sweeps  with  increasing  stimulus  level  was  minimal.  It 
should  be  noted  that  Moller  (1974)  also  observed  a  higher 
occurrence  of  downward-directed  cells  in  the  rat  cochlear 
nucleus  as  stimulus  intensity  was  increased. 


Preferred  speed 

The  cortical  distribution  of  preferred  speed  responses  ex¬ 
hibited  steep  gradients  along  the  dorsoventral  axis  of  AI. 
In  many  cases,  the  distribution  of  preferred  speed  re¬ 
sponses  was  largely  independent  of  CF,  especially  in  the 
dorsal  and  central  subregions  of  AI.  In  cases  where  pre¬ 
ferred  speed  varied  with  CF  (see  Fig.  8),  variability  was 
greatest  in  the  ventral  aspect  of  AI.  A  possible  explana¬ 
tion  for  this  variability  is  that  some  ventral  recording 
sites  may  actually  have  been  located  in  the  dorsal  asjject 
of  the  adjacent  auditory  field  (All).  Neurons  in  All  arc 
characterized  by  broad  tuning  curves  (i.e.,  with  small 
Qkwb  values;  sec  Schreiner  and  Cynader  1984;  Schreiner 
and  Mendelson  1990).  However,  the  transition  from  AI 
to  All  as  measured  by  the  progression  cf  Qioju  values  is 
extremely  gradual  such  that  the  distinction  between  these 
neighbouring  cortical  regions  may  be  nondetectablc 
(Schreiner  and  Cynader  1984).  Whatever  cortical  field 
they  occupied,  multiunit  responses  in  the  ventral  region 
appeared  to  be  functionally  distinct  from  responses  en¬ 
countered  at  more  dorsal  cortical  locations  in  AI.  Addi¬ 
tional  support  for  this  functional  distinction  is  derived 
from  the  stronger  correlations  observed  between  the  inte¬ 
grated  excitatory  bandwidth  and  preferred  FM  speed  in 
the  dorsal  as  compared  to  the  ventral  regions  of  AI. 

Although  the  three  FM  speeds  used  in  the  present 
study  were  not  sufficient  to  uniquely  determine  the  pre¬ 
ferred  FM  speed  for  each  location,  they  did  allow  a 
global  assessment  of  the  spatial  distribution  of  speed 
preferences  across  the  auditory  cortex.  Previous  studies 
(Mendelson  and  Grasse  1992)  of  FM  speed  tuning,  to¬ 
gether  with  single-unit  control  experiments  in  this  study 
(see  I-'ig.  4).  indicate  that  the  FM  speeds  selected  in  these 
experiments  were  sufficiently  spaced  to  safely  avoid  un- 
dcrsampling  speed-selective  responses.  However,  it  can¬ 
not  be  completely  ruled  out  that  some  of  variability  in  the 
data  could  be  due  to  ceils  or  cell  clusters  contributing  to 
the  sjjccd-tuning  responses  which  themselves  prefer 
speeds  that  fall  either  completely  outside,  or  somewhere 
in  between  the  actual  speed  conditions  used  in  this  study. 

The  spatial  distributions  of  preferred  speed  illustrated 
in  Figs.  6.  7A,  and  8  were  based  on  a  nonquantal  mea¬ 
surement  of  speed  preference.  The  advantage  of  this  ap¬ 
proach  was  that  it  incorporated  responses  to  all  three 
FM  speed  conditions.  However,  it  is  possible  that  this 
measurement  may  have  artificially  conferred  the  status  of 
a  continuum  on  the  distribution  of  FM  specd-seicctivc 
responses.  On  examination,  the  formula  used  for  this 
analysis  has  an  inherent  bias  that  assigns  a  prele.tiicr  for 
medium  FM  sweeps  to  some  responses  that  may  not  be 
speed  selective  at  all.  Yet,  it  is  unlike  ly  that  this  bias 
affected  the  essential  aspects  of  the  spatial  distnbu'ion  of 
preferred  speed  in  any  significant  way  for  the  following 
reasons.  First,  this  bias  only  affected  10%  of  all  the  co.'-ti- 


cal  responses  tested  (i.e.,  10%  of  all  units  in  this  study  had 
no  preferred  speed  as  determined  by  the  discrete  mea¬ 
surement).  Second,  based  on  the  nonquantal  measure¬ 
ment,  when  the  spatial  distribution  of  preferred  speed 
was  compared  with  that  based  on  the  discrete  speed  crite¬ 
rion,  a  close  agreement  was  observed  between  the  distri¬ 
butions  obtained  with  both  estimates  (see  Fig.  7). 

In  general,  the  largest  group  of  multiunit  responses 
displaying  a  unimodal  rate  tuning  profile,  responded  best 
to  fast  FM  sweeps,  the  second  largest  group  preferred 
medium  FM  sweeps,  and  the  smallest  group  preferred 
slow  FM  sweeps.  This  finding  is  remarkable  in  light  of 
the  frequency  transitions  that  have  been  measured  in  cat 
vocalizations  (Pick  1979).  It  has  been  shown  that  approx¬ 
imately  90%  of  the  transition  rates  of  frequency  glides  in 
cat  vocalizations  are  below  10  octaves  per  second.  This 
rate  corresponds  to  the  “slow"  FM  speed  used  in  the 
present  study.  Although  Pick  (1979)  may  not  have  chosen 
an  exhaustive  set  of  cat  vocalizations  for  this  analysis,  the 
available  data  suggest  that  FM  processing  at  the  level  of 
area  AI  of  the  cat  cortex  is  not  specialized  for  species- 
specific  vocalizations.  The  range  covered  by  the  preferred 
FM  speeds  reported  here  and  in  the  single-unit  studies  of 
Mendelson  and  colleagues  (Mendelson  and  Cynader 
1985;  Mendelson  and  Grasse  1992)  indicate  that  the  cor¬ 
tex  may  be  used  for  the  detection  of  frequency  transitions 
in  the  vocalizations  of  its  prey  (c.g.,  mice)  or  for  the  detec¬ 
tion  of  spectral  changes  useful  for  sound  localization.  The 
fact  that  many  cortical  responses  were  sensitive  to  the 
direction  of  FM  sweeps  suggests  that  neurons  were  in¬ 
deed  responding  to  the  rate  of  change  of  FM  sweeps 
rather  than  to  the  integrated  spectrum  of  frequency  com¬ 
ponents  covered  by  the  FM  sweep. 


Direction  selectivity 

In  general,  all  of  the  cases  examined  in  the  present  study 
showed  evidence  of  spatial  segregation  in  the  isofrequen- 
cy  domain  for  responses  to  cither  upward-  or  downward- 
directed  FM  sweeps.  The  dorsal  region  of  the  mapped 
areas  contained  units  that,  in  general,  responded  best  to 
upward-directed  FM  sweeps.  In  the  central  area  of  the 
maps,  multiunits  typically  responded  best  to  downward- 
directed  FM  sweeps.  In  most  cases  the  preference  in  the 
ventral  region  was  for  upward-directed  FM  sweeps, 
though  in  some  cases  this  region  contained  responses 
that  were  non-dircction-sclcctivc  or,  at  slow  FM  speeds, 
even  showed  some  preference  for  downward-directed 
sweeps.  For  the  most  part,  dircction-scicctivc  responses 
appeared  to  be  independent  of  CF.  Variability  between 
maps  in  the  spatial  distribution  of  preferred  direction  was 
most  often  observed  in  the  ventral  region  of  the  cortex. 
Units  located  in  the  dorsal  and  middle  regions  of  AI 
were,  in  general,  more  strongly  direction- selective  than 
units  located  in  the  ventral  region.  Thus,  the  degree  of 
direction  selectivity  may  be  related  to  the  variability  ob¬ 
served.  In  addition,  diiection  selectivity  may  also  be  re¬ 
lated  to  the  fact  that  significant  correlation  with  integrat¬ 
ed  excitatory  bandwidfh  was  observed  in  the  dorsal  but 
not  in  the  ventral  region.  Finally,  as  mentioned  above  for 
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preferred  speed,  it  is  possible  that  some  of  the  variability 
observed  for  locations  sampled  in  the  ventral  region  arc 
due  to  the  inclusion  of  units  belonging  to  All  rather  than 
AI. 

Several  possible  mechanisms  underlying  direction  se¬ 
lectivity  have  been  proposed  by  other  investigators  (e.g., 
Erulkar  et  al.  1968;  Suga  1965a, b,  1968,  1969;  Phillips  et 
al.  1985;  Shore  and  Nuttall  1985;  Shore  et  al.  1987; 
Mendclson  and  Grasse  1992).  With  the  exception  of 
Mendelson  and  Grasse  (1992),  the  suggested  mechanisms 
are  primarily  monaural  in  nature.  One  prevalent  type  of 
monaural  mechanism  is  characterized  by  inhibitory  side¬ 
bands  flanking  one  or  both  sides  of  the  excitatory  re¬ 
sponse  area.  Suga  (1965a,b,  1968,  1969)  and  Watanabe 
(1972)  have  accounted  for  FM  sweep  direction  selectivity 
in  terms  of  the  temporal  sequence  of  the  component  fre¬ 
quencies  in  the  sweep  that  successively  activate  inhibito¬ 
ry  and  excitatory  response  areas  of  neuronal  response 
piofiles.  It  has  been  suggested  that  asymmetric  inhibitory 
sidebands  can  generate  direction-selective  responses  to 
FM  sweeps  (Suga  1965a,b,  1968,  1969).  Recently,  Sutter 
and  Schreiner  (1991)  have  presented  evidence  that  the 
charaneristics  of  the  upper  and  lower  inhibitory  sidc- 
bar.ds  of  cortical  response  profiles  chinge  systematically 
across  the  dorsoventral  extent  of  cat  Al. 

The  spatial  distiibution  of  direction-selective  respons¬ 
es  observed  in  the  present  study  may  reflect  underlying 
inhibitory  organization  similar  to  that  observed  by  Sut¬ 
ter  and  Schreiner  (1991).  However,  inhibitory  sidebands 
may  not  completely  account  for  direction-selective  re¬ 
sponses  of  cortical  neurons.  Vartanian  (1974)  has  pointed 
out  that  cells  with  the  same  prei'erence  for  direction  of 
FM  sweep  can  exhibit  a  wide  variety  of  different  tuning 
ci  rves  and  different  inhibitoiy  areas.  In  addition, 
Mcndeison  and  Grasse  (1992)  have  recently  provided  ev¬ 
idence  demon:*rating  that  cortical  direction  selectivity 
need  not  arise  exclusively  from  a  monaural  input  but  can 
also  be  elicited  using  binaural  stimulation.  This  was  most 
apparent  in  ncuif'ws  that  appeared  non-direction-selec- 
tive  under  monaural  FM  sweep  conditions,  yet  were  di¬ 
rection-selective  when  stimulated  with  binaural  FM 
sweeps.  For  many  of  these  binaural  direction -selective 
responses,  the  basic  underlying  mechanism  seemed  to  op¬ 
erate  in  a  predominantly  inhibitory  fashion,  while  for 
olhr,  direction-selective  neurons  the  binaural  mccha 
nisni  was  excitafory. 

Preferred  speed  vt-rsus  preferred  direciion  of  FM  si'i  cp 

In  addiiioi  (o  clculating  direction  selectivity  for  the 
strongest  rcspc'isc  (independent  of  FM  speed),  direciion 
selectivity  was  aiso  determined  independently  for  each  of 
the  tlwec  FM  sweep  s[>eeds  used  in  the  present  study  and 
compared  with  that  derived  from  the  strongest  response. 
In  mos:  cases,  direction  selectivity  detennined  for  fast 
and  medium  FM  sweciis  closely  resembled  direction  se¬ 
lectivity  determined  for  the  strongest  response,  especially 
in  :hc  dorsal  and  niiddk.  vegions.  The  greatest  anitruni  of 
vanabilitv  of  direction  selectivity  occurred  with  slow  FM 
sweeps  and  was  most  apparent  in  the  central  and  ventral 


regions  of  the  cortex.  It  is  possible  that  much  of  this 
variability  can  be  attributed  to  responses  that  were  weak¬ 
er  than  those  obtained  with  medium  and  fast  FM  sweeps. 
The  weaker  respo  isivity  to  slow  FM  sweeps  may  be 
more  susceptible  than  medium  or  fast  FM  sweep  re¬ 
sponses  to  random  fluctuations.  A  reduced  responsive¬ 
ness  for  broadband  stimuli  has  been  observed  in  the  cen¬ 
tral  region  of  the  dorsoventral  extent  of  AI,  coinciding 
with  the  areas  of  slow  preferred  speeds  and  downward- 
directed  sweeps  (Schreiner  and  Mcndeison  1 990).  Howev¬ 
er,  this  coincidence  alone  cannot  explain  the  relative  loss 
of  direction  selectivity  for  slower  speeds  at  the  center  of 
the  maps,  because  those  locations  showed  good  direction 
selectivity  for  the  faster  FM  speeds. 

The  smallest  amount  of  variability  in  direction  selec¬ 
tivity  for  the  different  speed  conditions  occurred  in  the 
dorsal  region  of  AI.  This  suggests  that  direction  selectiv¬ 
ity  of  cells  in  dorsal  Al  is  largely  independent  of  FM 
speed.  In  contrast,  for  the  central  and  particularly  for  the 
ventral  aspect  of  AI,  direction  selectivity  may  be  more 
dependent  upon  the  speed  of  the  FM  sweep.  However, 
the  correlations  of  FM  speed  and  direction  for  cells  locat¬ 
ed  in  the  dorsal  region  and  for  those  located  in  the  ven¬ 
tral  region,  were  not  statistically  significant.  Thus,  while 
this  may  seem  to  contradict  some  of  the  results  described 
(i.e.,  units  in  the  mid-region  of  isofrcqucncy  contours  ap¬ 
pearing  to  prefer  slow  FM  speeds  and  downward-direct¬ 
ed  sweeps  while  units  located  more  dorsally  prefer  fast/ 
medium  FM  sweeps  and  upward-directed  FM  sweeps), 
the  difTcrcnces  observed  between  dorsal  and  ventral  re¬ 
gions  may  provide  a  clue  as  to  the  function  of  the  multi¬ 
ple  representations  of  upward-directed  FM  sweep  re¬ 
sponses  and  fast-  and  medium-selective  speed  responses 
distributed  within  AI.  Further,  while  these  relationships 
were  not  consistent  across  all  subregions  of  AI,  nor  with¬ 
in  nor  between  cases,  the  implication  of  these  results  is 
that  FM  speed  may  be  processed  independently  of  FM 
sweep  direction  in  some  but  not  all  cells  in  Al. 


Comparison  with  other  FM  studies 

At  the  level  of  the  auditory  nerve,  most  neurons  arc  not 
sensitive  to  FM  sweep  direction  except  at  very  fast  speeds 
(Sinex  and  Geisler  1981).  In  the  cochlear  nucleus  some 
researchers  have  found  that  cells  prefer  upward-directed 
FM  sweeps  at  high  rates  of  modulation  (Britt  and  St.irr 
1976;  Shore  et  al.  1987),  whil“  others  have  observed  a 
preference  for  downward-directed  FM  sweeps,  particu¬ 
larly  when  stimulus  intensity  is  increased  (Moller  1974). 
It  has  been  suggested  .hat  in  the  inferior  colliculus  and 
medial  geniculate  body  some  neurons  are  selective  for  the 
dri'ection  and/or  speed  of  FM  sweeps  (Watanabe  1972; 
Whitfield  and  Purser  1972;  Clopton  and  Winfield  1974; 
Vartanian  1974;  Poon  ct  al.  1991).  At  the  level  of  the 
cortex,  where  even  greater  integration  of  information 
from  the  periphery  occurs,  the  majority  of  neurons  have 
been  shown  to  respond  to  the  direction  and  speed  of  the 
FM  sweeps  as  well  as  to  the  extent  and  form  of  the  FM 
sweeps  (Suga  1965a,b;  Whitfield  and  Evans  1965;  Nomo- 
to  1980;  Scheich  and  Bonke  1981;  Mcndeison  and 
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Cyjiader  1985;  Phillips  et  al.  1985;  Mendclson  and 
Grasse  1992). 

Suga  and  his  colleagues  (Suga  1965a  1986;  0‘Ncill 
and  Suga  1982;  Tsuzuki  and  Suga  1988;  Edamatsu  et  al. 
1989)  were  the  first  to  demonstrate  that  responses  to  FM 
sweep  stimuli  are  systematically  distributed  in  the  cortex 
of  the  bat.  Respon.ses  to  FM  sweep  parameters,  such  as 
types  of  FM  sweeps  (eg.,  FMj-FMj,  FM,-FM,)  and  delay 
between  the  FM  sweeps  in  a  paired  presentation  arc  spa¬ 
tially  distributed  along  two  dimensions  in  a  region  of 
cortex  devoted  exclusively  to  processing  FM  stimuli. 
This  is  in  contrast  to  the  present  study  in  which  FM 
speed-  and  direction-selective  responses  appear  to  be 
spatially  distributed  along  one  cortical  dimension.  How¬ 
ever,  this  apparent  discrepancy  may  be  readily  explained 
by  the  fact  that  bats  have  a  highly  specialized  auditory 
sy.stem  equipped  to  handle  their  biosonar  communica¬ 
tion  signals. 

The  present  results  arc  most  directly  comparable  with 
the  single-unit  work  of  Mendclson  and  Cynader  (1985) 
and  Mendclson  and  Grasse  (1992),  as  these  studies  em¬ 
ployed  similar  FM  sweeps  to  those  used  in  the  present 
study.  In  agreement  with  the  present  results,  Mendclson 
and  colleagues  have  found  that  most  cortical  neurons 
respond  best  to  fast  or  medium  FM  sweeps.  These  au¬ 
thors  have  also  reported  that  59%  of  their  cells  (tested 
monaurally)  in  Al  were  direction  selective.  Of  these  direc¬ 
tion-selective  cells,  the  majority  (74%)  preferred  down- 
ward-directed  FM  sweeps.  In  the  present  study,  only 
45%  of  the  cortical  locations  sampled  were  direction  sc- 
lecuve.  However,  of  these,  almost  two  thirds  of  the  direc¬ 
tion-selective  responses  in  the  present  study  preferred 
downwaid-diiected  FM  sweeps. 

Se"'eral  possible  explanations  mav  account  for  the  dif¬ 
ference  ;n  the  percentage  of  direction-selective  cells  in  the 
two  types  of  studies.  The  first  is  that  different  types  of  FM 
sweeps  were  used:  Mendclson  and  colleagues  used  linear 
FM  sweeps  (0.0-50.0  kHz),  while  the  present  mapping 
study  used  exponential  FM  sweeps  (0.25-64.0  kHz).  In 
pilot  experiments  leading  up  to  the  present  mapping 
study,  a  comparison  was  made  between  responses  to  lin¬ 
ear  and  exponential  FM  sweeps  presented  at  two  differ¬ 
ent  FM  speeds.  It  was  found  that  some  cells  responded  in 
a  .similar  way  to  both  linear  and  exponential  FM  sweeps, 
while  other  unit  responses  changed  as  a  function  of  tl 
type  of  FM  sweep  presented.  Thus,  the  type  of  FM  sweep 
used  may  at  least  partially  account  for  the  difference  be¬ 
tween  the  single-unit  and  mapping  studies. 

A  second  explanation  for  the  difference  in  the  inci¬ 
dence  of  direction-selective  responses  may  be  related  to 
the  fact  that  the  single-unit  studies  presented  FM  sweeps 
at  approximately  20  dB  above  threshold,  while  the 
present  mapping  study  presented  FM  sweeps  at  40  dB 
above  threshold.  As  mentioned  above,  one  type  of  neural 
mechanism  thought  to  account  for  direction  selectivity  is 
based  on  the  presence  of  asymmetric  inhibitory  side¬ 
bands  (S’lga  1965a,b,  1968).  In  many  cases  these  inhibito¬ 
ry  :  '  :bands  only  become  apparent  at  higher  intensity 
levc. ..  Thus,  it  is  possible  that,  in  the  present  mapping 
study,  more  cells  were  encountered  that  possessed  both 
lower  and  upper  inhibitory  sidebands,  thcrcbv  rendering 


the  overall  response  of  the  unit  non-direction-sclcctive. 
Further  support  for  the  role  of  stimulus  intensity  can  be 
derived  from  the  work  of  Schreiner  and  Mendclson 
(1990).  They  found  that  while  the  cortical  spatial  distribu¬ 
tion  of  QiodB  and  Q40dB  appeared  similar,  these  two  re¬ 
sponse  properties  were  only  weakly  correlated,  suggest¬ 
ing  that  at  higher  stimulus  levels  additional  mechanisms 
may  be  involved  which  do  not  operate  at  lower  levels. 
Thus,  the  direction  selectivity  observed  at  lower  stimulus 
levels  may  not  be  manifest  at  higher  stimulus  levels. 

Finally,  the  difference  in  the  percentage  of  direction- 
selective  cells  between  the  two  types  of  studies  may  also 
be  due  to  single-  versus  multiunit  recording.  As  men¬ 
tioned  earlier,  a  multiunit  recording  samples  several  neu¬ 
rons  within  a  small  area.  It  is  possible  that  within  a  mul¬ 
tiunit  cluster  there  are  some  direction-selective  neurons 
and  some  uon-direction-selective  neurons.  Depending 
upon  the  number  of  direction  and  non-direction-sclcctive 
neurons  within  a  given  cluster,  it  is  possible  that  the  non¬ 
direction-selective  cells  may  dominate  and  diminish  the 
effects  of  the  direction-selective  neurons,  and  vice  versa, 
it  is  also  conceivable  that  some  units  within  a  cluster 
prefer  upward -directed  FM  sweeps  while  others  prefer 
downward-directed  FM  sweeps,  thus  causing  the  net  re¬ 
sponse  to  be  non-direction  selective.  In  this  sense,  the 
dircction-sclcctivc  responses  at  a  given  cortical  location 
reflect  the  dominant  neural  activity  characteristic  of  the 
cortical  processing  for  that  region. 


Distributions  of  other  stimulus  parameters  across  the 
isofrequcncy  domain  of  Al  have  been  discussed  in  detail 
previously  (Schreiner  and  Mendclson  1990;  Sutter  and 
Schreiner  1991;  Schreiner  et  al.  1992)  and  thus  will  only 
be  treated  summarily  here.  It  is  well  known  that  the  cor¬ 
tex  is  tonotopically  organized,  with  cells  that  possess 
high  CFs  encountered  in  the  anterior  region  and  lower 
CFs  in  the  posterior  region  of  Al.  Dorsovcntrally,  neu¬ 
rons  with  similar  CFs  arc  encountered  along  isofrequen- 
cy  corloiirs  (Rose  and  Woolscy  1948;  Merzcnich  et  al. 
1975).  One  response  feature  that  appears  to  be  distribut¬ 
ed  orthogonally  to  CF  is  the  type  of  binaural  response. 
Imig  and  Adrian  (1977)  and,  later,  Middlcbrooks  et  al. 
(1980)  found  alternating  bands  or  elongated  patches  of 
EE  and  El  cells  oriented  approximately  orthogonal  to 
isofrequcncy  contours.  These  results  have  recently  been 
extended  by  Rcalc  and  Kcttner  (1986)  who  compared 
intcraural  intensity  differences  with  equal-intensity  stim¬ 
uli.  They  found  that  the  distribution  of  EE  and  El  cells  in 
the  dorsal  and  middle  regions  of  Al  was  similar  to  that 
observed  in  other  studies  where  equal-intensity  stimuli 
had  been  used.  However,  in  ventral  Al,  Rcalc  and  Ket- 
tner  (1986)  described  a  disproportionate  number  of  cells 
with  both  facilitatory  and  inhibitory  response  properties. 
This  segregation  in  binaural  responses  between  the  ven¬ 
tral  and  more  dorsal  regions  of  Al  provides  additional 
support  for  the  present  findings  that  ventral  Al,  exclud¬ 
ing  those  sites  that  may  belong  to  All,  may  be  functional¬ 
ly  distinct  from  the  more  dorsal  regions  cf  Al. 


Comparison  with  other  auditory  mapping  studies 


Schreiner  and  Mendelson  (199C)  examined  the  spatial 
distribution  of  response  properties  to  pure  tone  and 
broadband  transient  stimuli  across  the  entire  dorsoven- 
tral  extent  of  AI.  The  most  significant  finding  was  the 
presence  of  a  systematic  distribution  of  the  integrated 
excitatory  bandwidth  along  the  isofrequency  domain. 
For  QiodB  Q40dB>  the  highest  values  were  found  in  the 
middle  of  AI,  while  locations  with  lower  Q  values  were 
located  in  more  dorsal  nnd  ventral  subregions,  support¬ 
ing  earlier  observations  by  Schreiner  and  Cynader  (1984). 
Responses  to  broadband  transients  were  also  systemati¬ 
cally  distributed  across  the  conex  such  that  those  units 
most  responsive  to  clicks  were  located  in  the  more  dorsal 
and  ventral  subregions. 

In  the  present  study  it  was  found  that  the  distribution 
of  preferred  speed  and  direction  responses  for  FM  sweeps 
were  correlated  with  integrated  excitatory  bandwidth 
and  broadband  transient  responses.  A  number  of  func¬ 
tional  relationships  were  revealed  by  this .  nalysis.  While 
some  of  these  statistically  significant  correlations  were 
relatively  weak,  they  still  suggest  that  functional  interac¬ 
tions  may  arise  between  these  response  parameters. 
These  relatively  low  correlation  values  also  suggest  the 
presence  of  other  factors,  such  as  inhibitory  sidebands, 
that  arc  most  likely  involved  in  the  neural  mechanisms 
underlying  speed  and  direction  selectivity  to  FM  sweeps. 
As  described  above,  in  the  dorsal  and  ventral  aspects  of 
AI,  multiunits  preferred  fast  or  medium  upward-directed 
FM  sweeps.  Coincident  with  this  FM  sweep  selectivity  is 
that  these  cortical  responses  also  tended  to  have  low 
QiodB  and  Q^odB  values  and  responded  best  to  broadband 
stimuli  (Schreiner  and  Mendelson  1990).  This  stands  in 
marked  contrast  to  responses  recorded  in  the  middle  re¬ 
gion  of  the  isofrequency  domain  that  preferred  slower 
(i.e.,  incd’um  and/or  slow  FM  sweeps)  and  downward-di¬ 
rected  FM  sweeps  characterized  by  narrow  tuning  curves 
and  poor  responses  to  transients  (Schreiner  and  Mendel¬ 
son  1990). 

Excitatory  bandwidth  and  speed  selectivity  of  a  corti¬ 
cal  location  can  be  linked  by  the  concept  of  integration 
time.  A  strong  response  is  only  elicited  if  the  stimulus 
remains  present  within  the  cell’s  excitatory  response  area 
for  some  minimal  period  of  time.  Thus,  slow  FM  speeds 
arc  required  to  excite  neurons  with  narrow  tuning  curves, 
whereas  faster  speeds  are  sufficient  for  more  broadly 
tuned  neurons.  The  apparent  link  between  excitatory 
bandwidth  and  direction  selectivity  (e.g.,  sites  with  nar¬ 
row  tuning  curves  responding  preferentially  to  down- 
ward-d':ected  sweeps)  is  less  straight  forward.  In  addi¬ 
tion,  it  may  also  involve  inhibitory  sidebands  that  are 
probably  used  to  sharpen  (i.e.,  tighten)  the  bandwidth  of 
neural  tuning  curves. 

The  .significant  correlation  between  responses  to 
broadband  transient  stimuli  and  preferred  FM  speed  is 
not  unexpected  given  that  a  fast  FM  sweep  cfa, 
essence,  be  considered  analogou:  a  trie!  broadband 
signal  not  unlike  a  click.  TJkjs,  thoic  urdts  dial  rc-spondcd 
bc.st  to  medium  or  fast  FM  sweep also  responded  well 
to  transient  signals,  i'he  fact  that  there  '.vas  no  siginfica,’) 
correlat’on  between  a  icsponsc  to  a  broadband  transient 
signal  and  a  direction-sclecti  'c  respens-^  tor  FM  sweeps 


suggests  that  there  arc  different  mechanisms  for  direction 
selectivity  and  preferred  speed  of  FM  sweeps.  Additional 
support  for  this  suggestion  is  derived  from  the  lack  of 
significant  correlations  between  preferred  speed  and  di- 
I'cction  of  FM  sweep  across  the  cortex. 


Psychophysical  studies 

A  number  of  investigators  have  provided  evidence  sug¬ 
gesting  that  FM  and  AM  stimuli  are  processed  indepen¬ 
dently  by  parallel  pathways  or  “channels”  in  the  audito¬ 
ry  system  (Kay  and  Matthews  1972;  Gardner  and  Wilson 
1979;  Regan  and  Tanslcy  1979;  Tanslcy  and  Regan  1979). 
A  recent  neuromagnetic  study  by  Makcla  et  al.  (1987)  has 
confirmed  these  findings  in  the  human  auditory  cortex.  In 
addition  to  the  independent  processing  of  AM  and  FM 
stimuli,  several  of  these  researchers  have  shown  that  up¬ 
ward-directed  linear  FM  sweeps  appear  to  be  processed 
independently  from  downward-directed  FM  sweeps. 
Further,  it  has  been  suggested  that  channels  exist  in  the 
auditory  pathway  that  are  sen.citivc  to  different  rates  of 
frequency  modulation  (Kay  1982).  The  global  cortical 
mechanism  described  here  is  composed  of  orderly  ar¬ 
rangements  of  responses  to  the  direction  and  speed  of 
FM  sweeps  and  may  help  in  understanding  some  of  these 
psychophysical  findings  by  maintaining  close  physiologi¬ 
cal  parallels  with  these  psychophysical  direction-  and 
rate-independent  FM  channels. 


Conclusions 

The  present  study  shows  that  multiunit  responses  to  the 
direction  and  speed  of  FM  sweeps  are  spatially  distribut¬ 
ed  in  a  nonrandom  fashion  along  the  dorsoventral  axis  of 
AI  of  the  cat.  In  general,  the  pattern  of  the  spatial  distri¬ 
bution  of  responses  to  the  speed  of  FM  sweeps  can  be 
summarized  as:  fast-mcdium-slow-mediura-fast,  pro¬ 
ceeding  in  a  dorsal  to  ventral  direction  across  AI.  Fur¬ 
thermore,  multiunit  responses  preferentially  selective  for 
upward-directed  FM  sweeps  were  located  in  the  dorsal 
and  ventral  aspects  of  AI  while  multiunit  responses  pre¬ 
ferring  downward-directed  FM  sweeps  were  located  in 
the  mid-region  of  the  isofrequency  domain.  In  addition  to 
the  already-established  tonotopicity  and  biuaural  re¬ 
sponse  properties,  these  results  provide  further  evidence 
for  a  systematic  distribution  of  FM  responses  which  de¬ 
fines  a  dimension  of  functional  organization  oriented 
along  the  isofrequency  domain  of  AI.  Finally,  thcfic  re¬ 
sults  may  provide  evidence  for  a  physiological  for 
independent  psychophysical  channels  underlyi.ug  the 
perception  of  F'M  sweeps. 
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Abstract 


The  neuronal  responses  to  tones,  broadband  transient  stimuli,  and  frequency-modulated  (FM) 
sweeps  were  mapped  in  the  primary  auditory  cortex  (AI)  of  barbiturate-anaesthetized  cats.  The  spatial 
distribution  of  the  final  two  response  parameters  in  this  series  of  experiments,  onset  latency  and  binaural 

response,  were  detennined  in  four  cases.  The  functional  relationship  between  these  two  parameters  and  the 
previously  reported  spectral  (QiodB>  Q40dB*  broadband  transient  stimulus),  intensity  (threshold, 

strongest  response  level  (SRL),  dynamic  range  and  monotonicity)  and  temporal  (FM  sweep  direction  and 
speed)  parameters  was  subjected  to  both  a  global  and  regional  analysis.  Onset  latency  responses  were 
systematically  distributed  along  tlie  dorsoventral/isofrcquency  axis  of  AI  such  that  units  with  shorter 
latencies  were  located  in  the  central  region  while  units  with  longer  latencies  were  more  often  found  in  the 
dorsal  and  ventral  portions  of  the  conex.  As  shown  by  other  investigators,  alternating  bands  or  patches  of 
EE  and  El  units  were  also  distributed  aa'oss  the  cortex.  Ihc  results  of  a  point-by-point  analysis  revealed  a 
lack  of  dependency  of  onset  latency  on  binaural  response  type.  However,  when  the  relationsliip  between 
onset  latency  response  and  the  remaining  10  response  parameters  was  examined,  the  correlations  with  the 
following  parameters  were  found  to  be  statistically  significant:  QiOdB’  *^-40dB'  broadband  transient 

response,  sffongest  response  level  (SRL),  monotonicity,  and  prefen-ed  FM  sweep  direction.  These 
correlations  suggest  that  units  that  respond  with  shorter  onset  latencies  are  more  sh.3rply  tuned,  do  not 
re.spond  well  to  broadbatid  stimuli,  have  a  higher  degree  of  nonmonotonicity,  and  prefer  FM  sweeps  that 
change  from  a  high  to  a  low  frequency.  Binaural  response  type  was  found  to  be  significantly  correlated 
with  Q4()(iB,  broadband  uansient  response,  threshold,  SRL,  dynamic  range,  and  preferred  FM  sweep 

speed.  These  coirelaiions  imply  that  El  neurons  tend  to  have  shaq^cr  tuning,  weaker  responses  to 
broadband  stimuli,  lower  ihresholds,  lower  SRLs,  a  nanower  dynamic  range,  higher  degree  of 
nonmonotonicity,  and  prefe'"  slower  FM  sweeps  than  EE  units.  The  systematic  distribution  and  the 
functional  relationship  between  tiiese  response  parzmeters  may  provide  the  representational  basis  for  the 
detection  and  identification  of  specific  features  in  the  animal’s  natural  envii'onment. 

Keywords:  Piimary  auditory  conex  -  Onset  latency  -  Binaural  response  type  -  Topographical  functional 
organization  -  Cat 
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Introduction 


Pievious  investigations  of  the  functional  organization  of  the  primary  auditory  cortex  (AI)  of  the  cat 
have  revealed  that  a  number  of  response  properties  appear  to  be  systematically  distributed  within  AI.  One 
of  the  most  prominent  and  best  documented  of  these  response  properties  is  the  representation  of  the 
characteristic  frequency  (CF)  of  neurons  (e.g.,  Woolsey  and  Walzl,  1942;  Merzenich  et  al.,  1975;  Reale 
and  Imig,  1980).  This  property  is  revealed  as  an  increase  in  the  CF  of  neurons  along  the  rostrocaudal 
extent  of  AI.  A  second  prominent  organizational  feature  of  AI  is  the  spatial  segregation  of  binaural 
facilitatory  (EE)  and  binaural  inhibitory  (El)  cells  along  the  dorsoventral  extent  of  AI  (Imig  and  Adrian, 
1977;  Middlebrooks  et  al.,  1980).  Reale  and  Kettner  (1986)  have  extended  these  findings  by  showing  tliat 
cells  located  in  the  ventral  region  of  cat  AI  contain  both  binaural  facilitatory  and  inhibitory  response 
characteristics,  depending  on  the  interaural  intensity  differences  of  the  stimuli  presented.  Recently,  a 
number  of  investigators  have  observed  that  other  response  properties,  such  as  spectral  characteristics, 
inhibitory  sidebands,  frequency  modulated  (FM)  sweep  selectivity,  as  well  as  a  subset  of  sound 
localization  cues,  appear  to  be  distributed  in  a  non-random  fasliion  within  AI  (Middlebrooks  and 
Pettigrew,  1981;  Jenkins  and  Merzenich.  1984;  Realc  and  Kettner,  1986;  Irnig.et  al,  1990;  Rajan  ct  al, 
1990;  Schreiner  and  Mendelson,  1990;  Sutter  and  Schreiner,  1991;  Heil  et  al.,  1992;  Mendelson  et  al., 
1993;  Shamma  et  al.,  1993). 

hi  a  recent  series  of  studies,  we  examined  the  spatial  distribution  of  twelve  properties  of  single- 
and  multiple-unit  response  profiles  in  AI.  One  of  the  goals  of  these  experiments  was  to  examine  a 
relauvely  large  number  of  response  properties  at  individual  cortical  locations  in  order  to  compare  different 
rcsTionse  pai  arneter  distributions  directly  and  to  ascertain  their  potential  interrelationships.  In  the  first 


report  in  this  seiies  of  expenuients  (Sciireinei'  aitd  Mendelson,  1990),  we  demonsuaied  that  the  “integrated 


excitatoiy  bandwidth"  (i.e.,  tlie  locally  effective  frequency  range  that  produces  excitation),  as  well  as 
responses  to  broadband  tran.sicnts,  were  topographically  arranged  within  AI.  The  principal  gradient  of  the 
integrated  excitatory  bandvvidtli  was  found  to  be  oriented  oithogonally  to  tlie  cochleotopic  representation 
such  that  narrowly  tuned  coiiical  neui'ons  were  typically  locatwi  in  the  dcrso-ventral  center  of  AI  while 
more  broadly  tuned  units  were  found  in  more  dorsal  and  ventral  subregions  of  the  cortex.  In  addition,  we 


I 
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found  that  cortical  locations  in  which  cells  displayed  strong  responses  to  broadband  transient  stimuli 
generally  favoured  the  extreme  dorsal  and  ventral  subregions  of  AI  and  not  the  sharply  tuned  central 
subregion.  Additional  response  parameters  such  as  multi-peaked  tuning  curves,  inhibitory  sidebands 
(Sutter  and  Schreiner,  1991),  stimulus  intensity  responses  (eg.,  threshold,  strongest  response  level, 
dynamic  rarige)  and  monotoniciry  (Schreiner  et  al.  1992)  have  also  been  found  in  these  studies  to  be  non- 
uniformly  distributed  along  the  dorsoventral  extent  of  AI.  Finally,  cortical  responses  to  the  direction 
(changing  from  a  low  to  a  high  frequency  or  vice  versa)  and  speed  (rate  of  change  of  frequency  over  time) 
of  FM  sweeps  have  been  shown  to  exhibit  an  orgaiiized  spatial  distribution  along  this  isofrequency  axis  of 
AI  in  cat  (Mendeison  et  al ,  1993)  and  feiret  (Shamma  et  al.,  1993). 

The  present  report  describes  the  spatial  distribution  of  two  additional  response  properties  in  this 
series  of  experiments,  onset  latency  and  binaural  interaction  responses,  and  compares  their  characteristics 
to  those  previously  describea.  Onset  latency  is  a  fundamental  descriptor  of  neuronal  responses  and  has 
been  studied  in  single  units  in  the  cortex  (Phillips  and  Irvine,  1981;  Phillips  et  al.,  1985).  However,  the 
transcortical  spatial  distribution  of  these  responses  and  the  relationship  of  onset  timing  behavior  to  other 
response  parameters  has  not  been  thoroughly  investigated.  While  the  spatial  distribution  of  binaural 
responses  has  been  examined  extensively,  its  inclusion  in  the  present  paper  provides  a  more  complete  and 
detailed  investigation  of  the  functional  topography  of  twelve  different  response  parameters  in  AI  studied 
under  identical  conditions.  In  this  way,  an  in-depth  analysis  of  the  covariance  of  all  twelve  response 
parameters  can  be  carried  out  in  parallel.  A  subset  of  these  correlations  will  be  presented  in  the  present 
paper  while  a  global  comparison  between  all  parameters  investigated  thus  far  will  be  dealt  with  in  a 
subsequent  paper  in  this  series. 

Materials  and  Methods 
Preparation 

A  detailed  desaipiion  of  tlic  preparation  and  recording  methods  has  been  provided  in  a  previous 
paper  in  this  series  (see  Schreiner  and  Mendeison,  1990)  and  thus,  will  only  be  summarized  here. 
Extensive  spatial  mapping  of  multiple  unit  responses  was  conducted  on  four  adult  cats.  In  addition,  the 
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latency  distribution  of  single  unit  responses  along  the  isofrequency  domain  was  obtained  (sec  Schreiner 
and  Sutter,  1992;  Sutter  and  Schreiner  1994).  Initially,  animals  were  tranquilized  with  an  intramuscular 
injection  of  ketamine  hydrochloride  (10  mg/kg)  and  acetylprornazine  maleate  (0.1  mg/kg)  to  allow  for 
venous  cannulation.  Animals  were  then  given  sodium  pentobarbital  (30  mg/kg  i.v.)  followed  by 
intramuscular  injections  of  atropine  (1  mg  im)  to  reduce  salivation  and  dexarnethasorie  sodium  phosphate 
(0.14  mg/kg  im)  to  prevent  brain  edema.  A  surgical  level  of  anaesthesia  was  maintained  throughout  the 
experiment  with  a  constant  i.v.  infusion  of  sodium  pentobarbital  (2  mg/kg/hr)  in  lactated  Ringer's 
solution.  Body  temperature  was  maintained  at  37.5  C. 

Both  pinnae  were  surgically  reflected  and  the  external  meatuses  exposed  to  allow  for  insertion  of 
speaker  tubes  within  3  mm  of  the  tympanic  membranes.  The  temporal  muscle  was  reflected  on  the  right 
side  and  a  craniotomy  was  performed  over  the  middle  of  the  ectosylvian  gyrus.  The  dura  mater  was 
removed  and  the  brain  was  bathed  in  siheone  oil.  A  photograph  of  the  surface  vasculature  was  used  to 
record  the  electrode  penetration  sites. 

Recording  procedure 

The  animal  was  placed  in  a  modified  head-holder  that  allowed  unobstructetl  access  to  the  external 
mead.  Recording  took  place  in  an  elcctrically-shicidcd  sound-attenuating  chamber  (LAC).  Parylene- 
insulated  tungsten  electrodes  were  introduced  into  the  cortex  orthogonal  to  the  surface  (as  viewed  through 
a  Zeiss  operating  microscope)  by  means  of  a  remote-controlled  stepping  motor  microdrive  (Kopf).  Single 
unit  and  unit  cluster  responses  were  examined  at  one  location  per  electrode  penetration  at  a  depth  between 
600  and  1000  )a.m  below  the  cortical  surface,  corresponding  approximately  to  layers  HI  and  IV.  The 
number  of  units  per  cell  cluster  was  estimated  to  be  between  2  and  6.  Stimulus-evoked  action  potential 
event  times  were  collected  and  stored  on-line  by  a  PDF  1 1/73  computer. 

In  all  cases,  a  cortical  siuface  area  of  approximately  3x5  mm^  to  3  x  15  mm^  was  examined  witli 
80-150  electrode  penetrations  in  each  map.  The  recording  strategy  was  to  map  the  isofrequency  domain  of 
AI  from  its  most  dorsal,  to  its  most  ventral  aspects,  often  extending  into  the  second  auditory  field  (AH). 
The  range  of  i.sofrequency  contours  in',  tstigaied  across  all  animals  ranged  from  3.2  to  28.0  kHz. 


Stimulus  Generation 

Tone  burst  stimuli,  generated  by  a  microprocessor,  (TMS32010;  16  bit  D/A  converter  at  120  kKt; 
low-pass  filter  at  35  kHz)  were  50  ms  in  duration  with  a  3  ms  rise/fall  time.  The  interstimulus  interval  was 
500-1000  ms.  Tone  bursts  were  presented  in  a  pseudo-random  sequence  of  different  ffequency/level 
combinations  selected  from  15  level  and  45  frequency  values  (geometrically  centred  around  the  CF). 
Stimulus  levels  were  in  5  dB  increments,  resulting  in  a  dynamic  range  of  70  dB.  Responses  to  a  minimum 
of  675  different  tone  bursts  were  recorded  at  each  electrode  site.  In  addition  to  the  pure  tone  burst  stimuli, 
animals  were  also  presented  with  broadband  transient  si'mals  (sec  Schreiner  and  Mendclson,  1990)  and 
exponential  FM  sweeps  (see  Mendelson  et  al.  1993).  FM  sweeps  extended  from  0.25  to  64.0  kHz 
(upward- directed)  or  from  64.0  to  0.25  kHz  (downward-directed).  The  rates  of  frequency  change  (i.e., 
sweep  speeds)  used  were  i  10  octaves/sec,  33  octaves/sec,  and  10  octavcs/scc  witli  corresponding  sweep 
durations  of  72  ms,  240  ms,  and  8(X)  ms,  respectively. 

Ail  stimuli  were  attenuated  (Hewlett-Packard)  and  passed  through  transducers  (STAX  54)  that 
were  attached  to  sound  delivery  tubes  sealed  into  the  auditory  mead.  The  stimulus  delivery  system  was 
calibrated  for  pure  tones  with  a  sound  level  meter  (Bruel  &  Kjaer  2209).  The  transfer  function  of  the 
sound  delivery  system  showed  a  roll-off  of  10  dB/oct  above  14  kHz  and  additional  deviations  of  up  to  6 
dB  wei  '  possible  (see  Schreiner  and  Mendelson,  1990). 


Data  Analysis 

Frequency  response  areas  (FRAs)  for  monaural  sdmulatiou  (conb'alateral  ear)  were  objccdvely 
determined  from  the  675  frcquency/lcve!  combinadons  recorded  at  each  electrode  site.  From  this  the 
integrated  exciuitory  bandwidth  at  10  and  40  dB  (Q^q^b  and  Q40(1B'  respectively)  above  tiireshold  were 

derived.  Shonest  (i.e.,  minimum)  onset  latency  and  latency  at  30  to  40  dB  above  minimum  threshold  were 
ascertained  from  the  latency/level  function  determined  for  each  unit  or  unit  cluster.  The  arrival  time  of  the 
first  spike  of  the  response  that  was  consistent  with  those  obtained  for  neighboring  stimulus  intensities  was 
taken  as  the  onset  latency  measure.  Binaural  interaction  responses  were  assessed  by  presenting  CF  to  both 
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ears  at  equal  intensity,  usually  5  to  15  dB  above  minimum  threshold.  Multiple-unit  or  single  unit  responses 
were  then  categorized  as  binaural  facilitatory  (EE),  binaural  inhibitory  (El),  or  binaural  occluder  (EO) 
responses  (whereby  the  ipsilateral  ear  had  no  significant  effect  on  the  binaural  response  of  the  unit).  The 
strength  of  the  unit’s  response  to  broadband  transient  stimuli  was  also  determined  at  each  cortical  location 
(expressed  in  %  of  the  maximum  response  to  a  pure  tone  stimulus).  In  addition,  the  following  response 
propenies  were  assessed:  contralateral  tlireshold,  monotonicity,  dynamic  range,  and  strongest  response 
level  (SRL)  preferred  speed  and  preferred  direction  of  FM  sweep  (sec  Schreiner  et  al.,  1992;  Mendelson  et 
al.,  1993). 

Pseudo  three-dimensional  projections  and  contour  plots  were  utilized  to  represent  the  spatial 
distribution  of  response  parameters  across  the  cortical  surface  (Golden  Software,  Inc.).  The  cortical 
locations  of  the  recording  sites  were  used  to  generate  a  two-dimensional  grid  of  the  represented  area  by 
projecting  the  actual  sites  to  the  nearest  grid  point.  Elevation  ut  the  gridded  surface  corresponds  to  the  local 
spatially  averaged  magnitude  of  a  functional  parameter  (e.g.,  onset  latency)  at  a  given  site.  The  elevated 
surfaces  are  represented  eitlier  as  die  orthographic  projection  of  the  surface  grid  or  of  the  elevated  contours 
of  equal  parameter  values  (e.g.,  isolatency  or  isofrcquency  contours).  In  some  cases,  the  actual  onset 
latencies  are  indicated  in  a  box  below  the  pseudo  three-dimensional  projection  for  direct  compaiison  of  the 
value  and  location  of  each  measurement.  The  interpolation  algorithm  used  (inverse  distance  .squared) 
inherently  applied  some  spatial  smootiiing  to  the  data,  resulting  mainly  in  a  slight  compression  of  the 
encountered  functional  range  and  a  local  averaging  of  closely  spaced  adjacent  points  (see  Schreiner  and 
Mendelson,  1990).  This  method  emphasizes,  although  only  slightly,  topographical  organizations  with  low 
spatial  frequency  and  de-emphasizes  steep  local  gradients  or  variabilities.  It  should  be  kept  in  mind  that  not 
every'  grid  point  is  directly  supported  by  a  datum  point  (equivalently,  not  every  .section  in  a  linear  or 
polynomial  fit  of  a  two-dimensional  data  distribution  is  equally  suppoiied  by  data  points).  The 
'smoothing'  of  the  surface  plots  represents  only  a  small  distortion  of  the  data  tliat  is  net  unlike  a  two-point 
or  three-point  averaging  in  a  two-dimensional  data  representation.  In  this  case,  the  interpolation  can  be 
considered  to  occur  along  both  sets  of  gridlines. 
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Results 

A  total  of  463  single  and  multiple  unit  responses  for  which  a  CF  could  be  determined  were 
recorded  from  four  extensively  mapped  animals.  In  addition,  104  single  units  and  1 1 1  mulriple  units 
encountered  along  empirically  determined  isofrequency  axes  were  investigated  in  four  additional  animals. 
For  multiple-unit  mapping,  the  average  distance  between  each  recording  site  was  approximately  150-350 
[im.  Within  each  map,  a  large  portion  (>75%)  of  the  tonotopic  map  indicative  of  AI  was  observed 
(Merzenich  et  al.,  1975;  Reale  and  Imig,  1980).  For  three  of  the  four  cases,  the  spatial  distributions  of 
characteristic  frequency  (CF),  integrated  excitatory  bandwidth  (expressed  as  or  C^odB^’ 

broadband  transient  stimulus  (Schreiner  and  Mendelson,  1990),  FM  sweep  responses  (Mendelson  ct  al., 
1993),  and  tone  intensity  responses  (Schreiner  et  al.,  1992)  are  described  in  the  other  papers  in  this  scries 
and  thus  will  only  be  briefly  summarized  here.  We  begin  with  mapping  data  (N=  186)  for  one  case  which 
has  not  been  previously  described  and  for  which  the  spatial  distribution  cf  the  frequency  tuning  curve  (or 
CF),  QiodB’  minimum  tlireshold  (case  #86-173)  have  been  determined. 

The  CF  of  units  for  case  #86- 173  increased  monotonically  from  approximately  4  to  13  kliz  (Fig. 

1  A)  as  the  electrode  was  moved  in  a  caudal  to  rostral  direction  for  most  parts  of  the  cortical  area  mapped. 
In  the  ventral  portion  of  the  map,  a  sudden  increase  in  CFs  above  16  kHz  indicates  the  transition  to  the 
second  auditory  tieia  U;  Schreiner  and  Cynader,  1984).  The  integrated  excitatory  bandwidth  of  the 
multi-unit  responses  for  this  case  changed  systematically  along  the  dorsoventral  axis  of  area  AI  and  across 
its  ventral  border  with  area  AH  (Fig.  IB).  The  ventral  ponion  of  the  mapped  area  contained  units  that 
possessed  the  lowest  QjodB  corresponding  to  broad  integrated  excitatory  bandwidths.  Many  of 

these  ventral  multiple-unit  responses  were  most  likely  located  in  AH  (Schreiner  and  Cynader,  1984). 
QlOdB  values  increased  continuously  toward  the  center  of  Al,  with  the  n;uTowest  bandwidths  recorded 

approximately  2  mm  dorsal  to  the  AI/AII  border.  This  central  region  with  narrow  integrated  excitatory 
bandwidths  and  a  strict  cochleotopic  gradient  is  indicative  of  fhe  classically  defined  primary  auditory  area 
(Merzenich  et  al.,  1975;  Reale  and  Imig,  1980).  Moving  from  central  Al  toward  the  suprasylvian  sulcus, 
bandwidth  sharpness  gradually  decreased.  In  the  most  dorsal  portion  of  the  mapped  region,  the  principal 
cochleotopic  gradient  appeared  to  be  maintained  despite  a  slight  change  in  die  orientation  of  the 
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isofrequency  contours  (see  Fig.  1  A)  that  is  consistent  with  previous  reports  (Middlebrooks  and  Zook, 
1983;  Sutter  and  Schreiner,  1992). 

In  addition  to  the  two-fold  representation  of  the  integrated  excitatory  bandwidth  falling  along  the 
dorso-ventral  axis  of  AI,  a  second,  less  well-defined  gradient  of  tuning  bandwidth  was  evident  in  the 
central  portion  of  AI  mnning  along  the  postero-anu  ior  axis.  This  second  gradient  was  in  register  with  the 
frequency  gradient  typically  observed  in  single  unit  data  (Phillips  and  Irvine,  1980).  The  basic  features  of 
the  distribution  of  integrated  excitatory  bandwidth  across  the  middle  ectosylvian  gyrus  revealed  by  case 
#86-173  was,  in  general,  consistent  with  those  found  in  all  other  cases  studied  thus  far  (see  Schreiner  and 
Mendelson,  1990). 

The  spatial  distribution  of  minimum  response  thresholds  for  case  #86-173  is  shown  in  Figure  1C. 
A  low  threshold  region  was  evident  in  the  central  and  dorsal  subregions  with  higher  thresholds  in  the 
ventral  and  most  dorsal  portions  of  the  map.  An  increase  of  threshold  along  the  cochleotopic  gradient 
toward  higher  CFs  was  also  observed.  A  broad  region  of  lowest  thresholds  was  found  to  be  in  close 
alignment  with  the  region  of  overall  sharpest  tuning  (see  Schreiner  et  al.,  1992;  Heil  et  al.,  1992). 


Representation  of  Onset  Latency 

On  average,  onset  latency  responses  to  CF  tones  presented  30  -40  dB  above  tlireshold  varied 
between  8  and  18  ms  for  all  cases  studied.  Fig.  2  illustrates  representative  latency/intensity  level  profiles 
for  four  multiple-units  from  case  #87-518.  Latency  was  measured  in  5  dB  steps  over  a  range  of  70  dB.  As 
can  be  seen  in  all  four  panels,  onset  latencies  were  longer  at  lower  intensity  levels.  For  the  response  profile 
illustrated  in  Fig.  2a,  onset  latency  decreased  non-linearly  as  stimulus  intensity  increased.  In  Fig.  2b,  there 
was  a  precipitous  decrease  in  the  onset  latency  between  10  -  25  dB  at  winch  point  latency  reached 
asymptote.  The  onset  latency  profile  shown  in  Fig.  2c  gradually  decreased  as  stimulus  intensity  increased 
until  approximately  60  dB  above  threshold  where  the  onset  latency  began  to  increase.  Fig.  2d  shows  a  unit 
for  which  the  onset  latency  decreased  gradually  and  did  not  asymptote  until  40  dB  above  threshold.  All 
other  recorded  latency/level  profiles  resembled  one  of  these  examples. 
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Fig.  3  iliustrates  the  relationship  between  onset  latency  atid  CF  for  each  of  the  four 
intensively  studied  cases.  As  can  be  seen,  the  large  variance  in  the  onset  latency  data  obscures  any 
potential  dependency  upon  CF.  In  other  words,  a  wide  range  of  onset  latencies  were  encountered  for  each 
narrov/  range  of  CFs.  The  overall  increase  of  latency  for  CFs  below  15kHz  in  case  #87-001  (Fig.  2B)  is 
not  necessarily  a  reflection  of  the  latency/CF  relationship  suggested  from  studies  of  the  peripheral  auditory 
system  (eg.  Mpller,  1977).  If  one  postulates  a  non-uniform  spatial  distribution  of  latency  values  along  the 
dorsal-ventral  extent  of  AI,  this  increase  in  latency  might  just  be  a  reflection  of  an  incomplete  spatial 
sampling  of  the  isofrequency  domain.  Interestingly,  the  orientation  of  the  mapped  cortical  area  of  cat  #87- 
001  was  turned  24*^  with  regard  to  the  orientation  of  the  i.,ofrequcncy  domain  (see  Fig.  10;  sec  also  Fig.  4 
in  Schreiner  and  Mendelson,  1990).  Therefore  it  is  conceivable  that  a  short-latency  region  of  the  lowest 
CFs  for  this  case  was  missed  during  the  mapping. 

Overall,  the  global  spatial  distribution  of  onset  latency  values  obtained  at  30-  40  dB  above 
threshold  exihibited  a  non-random  organization.  There  was  a  tendency  for  multiple-units  located  the 
more  dorsal  and  ventral  subregions  of  AI,  as  well  as  those  located  in  dorsal  AH,  to  respond  with  longer 
onset  latencies  than  those  encountered  in  central  AI.  In  some  cases,  variability  in  onset  latency  was 
obsenred  in  the  dorsal  portion  of  AI.  Fig.  4  illustrates  an  example  (#87-(X)l)  of  the  spatially  reconstructed 
distribution  of  onset  latencies  observed  over  a  high-frequency  zone  of  AI.  As  mentioned  above  for  this 
case,  units  with  CF's  between  10.3  and  26.3  kHz  were  examined.  Below  the  pseudo- three-dimensional 
plot  is  an  insert  showing  the  latency  values  (in  msec)  to  CF  observed  at  each  cortical  location.  In  the 
pseudo-three-dimensional  projections,  the  arrowheads  al.ong  the  X-axis  indicate  the  approximate 
orientation  of  the  isofrequency  contours.  For  this  case  the  orientation  of  the  mapped  area  was  tilted  24° 
relative  to  the  isofrequency  orientation,  resulting  in  an  approximate  alignment  of  the  18.5  kHz  contour 
with  the  ventro-posterior  to  dorso-anterior  axis  of  the  map.  The  Y-rntis  represents  the  dorsoventral  extent 
over  5  mm.  The  elevation  of  the  surface  of  the  plot  (i.e.,  along  the  Z-axis)  indicates  the  onset  latencies  at 
the  recording  locations.  Low  elevations  indicate  shorter  onset  latencies,  while  lugher  elevations  indicate 
longer  onset  latencies  In  general,  longer  onset  latencies  were  o'  ■‘’^■rd  in  the  dorsal  subregion  of  AI  and 
to  a  lesser  extent,  in  the  ventral  subregion.  This  is  apparent  when  considering  the  isofrequency  contours 


between  18.5  £ind  22.0  kHz.  At  lower  frequencies,  longer  onset  latencies  were  observed,  especially  in  the 
dorsal  and  central  aspects  of  AI. 

Fig.  5  shows  pseudo-thice-dimensional  plots  of  the  v.-nsct  latency  disaibunon  for  three  other  cases. 
Fig.  5a  shows  the  spatial  distribution  for  #86-173.  As  car;  be  seen,  longer  onset  response  latencies  were 
typically  encountered  in  the  moie  dorsal  and  ventral  aspects  of  AI  (including  the  dorsal  aspect  of  All.  see 
Figure  lA).  Shorter  onset  latencies  were  observed  in  the  more  central  subregion  of  AI.  The  case  illustrated 
in  Fig  5b  differed  from  the  others  in  that  three  subregions  were  disclosed  containing  rclauvely  longer 
onset  latencies  and  two  subregions  containing  shoner  onset  latencies.  The  spatial  frci^ucncy  of  Icmgcr  and 
shorter  onset  latencies  tor  this  case  (i.e  .  spatial  distance  benveen  maxima  and  minima)  was  higher  than 
that  obs<:.rved  in  other  cases.  For  the  example  illustrated  in  Fig  5c  the  short  onset  latency  zone  occupied  a 
relatively  smaller  aiea  in  the  central  pexmon  of  cortex  than  lu  other  cases,  while  longer  latencies  were  found 
over  a  larger  and  more  ventral  zone 

A  comparison  ot  the  spaoai  distnbuaons  o?  onset  latencies  for  single  and  mulu-umfs  is  shown  ui 
Figs  6-8  These  spaiial  distributions  are  depicted  in  relation  to  the  map  derived  tor  integrated  excttalorv 
bandwidth  calculated  at  40  dB  abose  (hrtsbold  tor  HW4(>.  see  Schreiner  and  Sutter.  I‘i>d2.  Setter  and 
Schremer.  1994 1  In  order  to  compare  the  spatial  distnbuoons  of  onset  lateiKies  across  cases,  the 
minimum  in  the  BW4()  distnbution  o  c..  the  region  where  cells  were  most  shaiply  tuned  ai  40  dB  above 
ihrishold)  was  assigned  the  reference  value  of  0  mm  The  positions  of  dorsal  (negative  values)  and  ventral 
(positive  values)  recording  points  are  shown  relaove  to  this  0  mm  position  A  scancr  plot  for  single  unit 
(Fig.  6a)  and  mulu-unu  (Fig  bb»  latencies  calculated  ai  .V)  clB  above  threshold  is  shown  as  a  function  of 
their  dorsoventrai  iocauun  in  uSc  cortex  rclaiisc  to  BW4()  !n  general,  onset  latencies  tended  tc  be  shorter 
for  mulu-unii  responses  than  for  single  unit  responses  (Ml'  16  24±5  16.  SL'  19  Mi'^.()3)  In  addition, 
particularly  for  units  liKated  in  the  central  subfegiim  of  AI.  there  was  a  greater  scatter  of  onset  late«cies  for 
single  units  (Fig  fva)  a.%  compared  to  muiuple-units  (Fig  6bi  Howevei.  the  cA-crall  pancm  of  the  spatial 
distnbution  tor  single  and  multiple-unit  responses  was  similar  with  a  higho'  proportion  ot  shoner  onset 
latencies  located  in  the  central  subregton  of  AI  This  is  illustrated  m  the  bat  histogram  in  Fig  where 
variance  (indicated  ■.  the  enor  barsi  is  greater  tor  single  (Fig  'ai  than  for  multi -units  (Fig  7b). 


particulary  in  the  central  subregion  of  the  cortex.  Finally,  there  appeared  to  be  siPtller  variance  of  onset 
latency  responses  in  the  central  as  opposed  to  the  ventral  and  especially  the  dorsal  portions  of  the  cortex. 

Fig.  8  shows  the  spatial  distribution  of  the  minimum  latency  for  single  and  multiple-unit 
responses.  Minimum  latency  was  detcrmi,ncd  by  noting  the  stimulus  condition  that  elicited  the  shortest 
latency  response  independent  of  stimulus  intensity.  This  point  typically  corresponded  to  the  asymptote  of 
the  latcncy/utiensits  level  curve  As  with  the  latencies  obtained  at  30dB  above  threshold,  minimum  onset 
latencies  for  multi  urots  were  shorter  than  for  single  units  (MU:  13.09±3  88ms.  SU;  15.04±4.28ms). 

This  spaiial  distribution  resembles  the  one  derived  for  onset  latency  at  30  dB  above  threshold  widi  the 
excepuon  that  latencies  were  shorter  and  there  was  greater  variance  in  the  spaiial  distribution  calculated  for 
minimum  onset  response 

Repreirntaiion  Binaural  Response 

Fig  ^  illustrates  the  spaual  distributions  of  binaural  responses  for  the  four  cases  described  in  this 
paper  Fig  S»a  shows  an  enlarged  schematic  drawing  of  the  cortical  area  (shown  in  the  insert  on  the  right) 
invesugaied  in  case  oHb- 173  The  filled  circles  represent  those  locations  where  multi-unit  responses  were 
classilied  as  Kt.  the  open  cuvles  represent  the  locations  of  El  cells,  and  the  filled  triangles  indicate  EO 
cells  In  general.  EE  cells  were  typically  found  in  the  dorsal  subregion  of  AJ  In  the  rest  of  the  cortical  area 
exanuned.  altcmaung  bands  or  patches  of  EE  and  El  responses  were  observed.  Fig.  9b  is  derived  from  the 
dau  illustrated  in  Fig.  9a  and  is  redrawn  in  t’  t  form  of  a  contour  plot.  The  dashed  line  represents  the 
border  between  .Al  and  All  as  revealed  by  tlic  integrated  excitatory  bandwidth  measurements  (Schreiner 
arj  Cynadcr.  1984;  Schreiner  and  Mcndclson,  1990).  The  stippled  areas  represent  locations  w'hcre  EE  or 
EO  cells  were  encountered  and  the  white  or  blank  areas  represent  the  locations  of  El  cells.  For  this  case, 
ihc  spaual  representation  of  binaural  respon.ses  was  comprised  largely  of  clusters  or  patches  as  opposed  to 
alternating  elongated  bands  of  EE  and  El  units  contour  plots  of  binaural  responses  for  tlie  other  three 
cases  arc  illustrated  in  Fig.  9c  (#87-001),  9d  (#b/-518),  and  9e  (#87-706),  respectively.  Although  not  all 
bands  span  the  entire  mapped  regions,  in  these  latter  three  cases,  EE  and  El  responses  showed  a  much 


more  marked  tendency  to  orgarnze  themselves  into  alternating  elongated  bands  (Imig  and  Adrian.  1977; 
Middlebrooks  et  ai..  1980). 

Onsei  Latency'  versus  Binaural  Response 

An  attempt  was  made  to  correlate  the  spatial  distribution  of  binaural  response  type  with  onset 
latency.  Table  I  shows  the  average  on.sct  latency  response  of  the  binaural  classes  for  cacn  case.  Only  case 
#86- 173  contained  a  sufficient  number  of  EO  responses  to  be  included  in  the  analysis.  Fig.  10  shows  the 
relationship  betv  een  these  two  response  parameters  for  case  #87-518  (Fig.  iCa)  and  case;  #87-(X)l  (Fig. 
10b)  Contour  plots  have  been  generated  to  illustrate  this  point  nreore  clearly.  The  contour  plots  in  the  left 
panel  of  ei*ch  side  of  Figure  10  show  the  spadal  distribution  of  binaural  responite  while  the  plots  in  the 
right  panel  show  the  spatial  distribution  of  onset  latency.  The  dashed  line  represents  a  prominent 
isofrequcncy  contour  for  each  case  (viz..  10.0  and  18.0  kHz,  respectively).  The  shaded  areas  in  the 
binaural  response  contour  plots  demarcate  those  subregions  where  EE  or  EO  unit  clusters  were 
encountered  while  the  shaded  areas  in  the  onset  Utcncy  plots  illustrate  subregions  where  onset  response 
latencies  were  less  than  10.6  ms.  In  general,  these  figures  support  the  observation  that  tficre  is  little 
covanance  between  onset  latency  and  binaural  responses.  For  case  #87-(X)l  (Fig.  10b),  there  docs  appear 
to  be  some  dependency  of  onset  latency  response  on  binaural  response  type.  However,  as  revealed  by  the 
results  of  a  point-by-point  correlational  analysis  (Table  I)  this  observation  was  not  statistically  significant. 
A  significant  covariance  between  these  two  response  parameters  was  only  evident  for  one  of  the  four 
cases;  #87-706  (p  <  0.03). 

Onset  latency  versus  spectral,  intensity,  and  temporal  response  properties  obtained  at  the  same  conical 
locations 

In  an  attempt  to  further  our  understanding  of  the  functional  topography  of  AI,  the  relationship 
betv/een  onset  latency  and  response  parameters  previoulsy  described  (Schreiner  and  Mendelson,  1990; 
Schreiner  et  al.,  1992;  Mendelson  et  al.,  1993)  was  examined  in  the  present  study.  To  this  end,  statistical 
correlations  between  thest  response  parameters  were  analyzed  over  tire  entire  cortical  region  (AI)  in  the 


three  cases  for  which  complete  data  sets  were  obtained;  87-001,  87-518,  87-706.  In  addition,  statistical 
coirelations  were  also  detemiined  separately  for  cells  located  in  the  dorsal  (Aid)  and  the  ventral  subregions 
(Alv)  of  the  cortex.  Figure  1 1  illustrates  how  the  isofrequency  domain  was  divided  based  on  its  QlO-dB 
distribution.  The  central  ponion  of  AI  'vas  defined  as  that  region  that  contained  the  majority  of  the  sharply 
tuned  cortical  sites.  Aid  represents  recording  locations  in  that  region  of  the  cortex  dorsal  to  the  most 
sharply  tuned  responses  while  Alv  rcpreser's  those  units  located  ventral  to  it. 

Spectt  al  Bandwidth  Parameters  (Q iQd3'Q40dB>  ^roadbarui  transient  response).  In  a  series  of 

experiments  examining  the  functional  topography  of  auditoiy  cortex,  Schreiner  and  Mendelson  (19%) 
showed  that  overall  there  was  a  fairly  systematic  distribution  of  the  integrated  excitatory  bandwidth  along 
the  isoffequcncy  domain.  For  and  Q40£1b*  highest  values  for  both  were  found  in  the  central 

region  of  AI  while  locations  with  lower  Q  values  were  located  in  more  dorsal  and  ventral  subregions. 
Responses  to  broadbanj  transient  stimuli  were  also  systematically  distributed  across  the  cortex  such  that 
those  recording  sites  where  units  svctc  most  responsive  to  clicks  were  located  in  the  more  dorsal  and 
ventral  subregions  of  the  cortex,  coinciding  with  more  broadly  tuned  regions. 

The  correlation  of  integrated  excitatory  bandwidth  and  broadband  transient  responses  with  onset 
latencies  is  shown  in  Table  II.  Only  the  sutistically  significant  correlations  are  included.  As  can  be  seen, 
for  two  of  the  three  cases  (#87-106  and  #87-001)  there  was  a  strong  negative  correlation  between  onset 
latency  and  Qiqjb  ‘^‘-'^ss  the  entire  mapped  region  of  conca.  When  examined  by  cortical  subregion,  both 

of  these  cases  exhibited  a  statistically  significant  correlation  between  these  two  response  parameters  in  the 
dorsal  but  not  Uie  ventral  portion  of  the  cortex.  These  results  suggest  that  sharply  tune  cells  respond  with 
shorter  onset  latencies.  For  there  was  a  strong  negative  correlation  with  onset  latency  for  these 

same  two  cases.  However,  unlike  the  significam  correlation  with  04'^  was  obiervcd  across  the 

entu’e  region  of  AI  as  well  as  in  the  dorsal  subregion  for  #87-(X)l  while  it  was  only  statistically  significant 
iti  the  ventral  subregion  of  the  concx  for  #87- 106. 

For  onset  latency  ard  broadband  tran.sicnt  response  a  stnnstically  significant  negative  correlation 
was  observed  in  all  three  cases.  This  suggest  that  cells  whicn  respiond  vigorously  to  a  broadband  stimulus 
respond  with  longer  on.sct  latencies.  For  two  of  the  cases,  the  correlation  was  only  evident  when  the  entire 


extent  of  the  cortical  region  studied  was  considered.  For  the  third  case,  the  correlation  was  only  significant 
in  the  ventral  subregion  of  the  conex. 

Tone  Intensity  Parameters  (monotonicity,  SRL,  threshold,  dynamic  range).  Schreiner  et  al.  (1992) 
reported  that  unit  clusters  with  monotonic  raie/lcvel  functions  were  spatially  segregated  from  locations  with 
nonmonotonic  rate/level  functions.  Typically,  two  noncontiguous  areas  with  a  high  degree  of 
nonmonotonicity  were  observed:  one  was  located  at  the  dorso-vcntral  center  of  AI  and  a  second  in  the 
dorsal  third  cf  AI,  TTic  stimulus  levels  that  produced  the  highest  SRL  at  any  sampled  location  were  also 
found  to  be  systematically  distributed  across  the  cortex  such  that  a  region  of  low  SRLs  was  always  found 
in  the  central  ponion  of  AI.  The  spatial  distribution  of  contralateral  monaural  threshold  indicated  several 
segregated  areas  containing  clusters  with  either  higher  or  lower  response  thresholds.  Finally,  the  dynamic 
range,  defined  as  the  monotonic,  fast-growing,  low-level  portion  of  the  ratc/level  function  did  not  reveal  a 
clear  systematic  spatial  distribution  that  appeared  to  be  aligned  with  either  the  CF  gradiem;  or  the 
isofrequency  axis. 

The  analysis  of  onset  latency  and  tone  intensity  responses  yielded  several  significant  correlations. 
For  two  of  the  three  cases  (#87-518  and  87-(X)l),  there  was  a  significant  negative  conelation  between 
monotonicity  and  onset  latency  for  the  entire  coitcx  examined.  This  suggests  that  units  with  monotonic 
ratc/level  functions  tend  to  respond  with  shoncr  onset  latencies  and  that  units  with  high  degrees  of 
nonmonotonicity  respond  with  slightly  longer  latencies.  SRL  was  also  found  to  be  significantly  correlated 
with  onset  latency  for  two  of  the  three  cases.  For  one  of  the  cases  (#87-(X)l)  the  significant  correlation  was 
only  apparent  in  the  ventral  subregion  of  AI  while  for  the  second  case  (#87-706)  it  was  significant  for  the 
entire  extent  of  cortex  examined.  Finally,  onset  latency  was  significantly  correlated  winh  two  other  tone 
intensity  parameters:  contralateral  threshold  for  case  #87-706  and  dynamic  range  for  case  #87-001.  For 
both  of  these  correlations,  statistical  significance  was  only  observed  in  the  ventral  subregion  of  the  cortex. 

Temporal  Parameters  (FM  Sweeps).  Mendcison  ct  al,  (1993)  showed  dial  preferred  speed  of  FM 
sweep  appeared  to  be  systcmadcaily  distributed  along  the  dorsoventral  axis  of  area  AI.  In  the  dorsal 
subregion,  conical  cells  typically  preferred  fast  and/or  medium  FM  sweeps,  followed  more  ventrally  by 
cells  that  preferred  medium,  then  slow,  then  medium  speed  FM  sweeps.  In  the  more  ventral  aspect  of  AI 


neurons  generally  prefcned  fast  FM  sweeps.  Direction  selectivity  also  appeared  to  be  nonrandomly 
distributed  along  the  dorsoventral  axis  of  AI.  Units  that  prefen-ed  upward-directed  FM  sweeps  were 
typically  located  in  the  more  dorsal  and  ventral  aspects  of  AI  while  units  that  preferred  downwjird-diiected 
FM  sweeps  were  located  in  the  central  pordon  of  AI. 

Comparison  of  the  spatial  distribution  of  onset  latency  and  FM  sweep  resp>onses  revealed  that  there 
was  a  statistically  significant  correlation  between  latency  and  preferred  direction  of  FM  sweep.  This 
correlation  was  significant  for  the  entire  extent  of  AI  as  well  as  for  the  dorsal  subregion.  For  two  of  the 
cases  (#87-706  and  87  -{X)l),  there  was  a  positive  correlation  while  for  the  third  case  (#87-518)  there  was  a 
negative  correlation  between  these  response  parameters.  In  other  words,  for  two  of  the  cases  (#87-706  and 
87-001 ),  cells  prefeiring  upward-directed  FM  sweeps  tended  to  respond  with  longer  onset  latencies  while 
cells  preferring  downward-directed  sweeps  tended  to  respond  with  shoner  onset  latencies.  This 
relationship  was  reversed  for  case  #87-518. 

Binaural  response  type  versus  spectral,  intensity,  and  temporal  response  properties  obtained  at  the  same 
cortical  locations 

In  order  to  assess  potential  differences  between  the  three  types  of  binaural  responses,  an  analysis 
of  tlie  data  was  perf  ormed  across  all  three  of  the  more  extensively  mapped  cases  inespcctive  of  spatial 
location.  The  foanh  case  (#86-173)  is  not  considered  for  this  analysis,  since  only  three  out  of  ten 
parameters  were  available  for  this  case.  The  global  means  and  standard  deviation  for  these  three  cases  are 
provided  in  Table  III.  For  (his  analysis,  significant  differences  (p<0.05)  between  EE  and  El  neurons  were 
observed  for  6  of  the  10  tested  response  parameters:  Q40(1b,  broadband  transient  response,  minimum 

threshold.  SRL,  dynamic  range,  and  preferred  FM  speed.  El  neurons  tended  to  have  sharper  tuning, 
weaker  broadband  responses,  lower  thresholds  and  SRL  smaller  dynamic  range,  and  preferced  slower 
FM  speeds  tiian  EE  neurons  Furthermore,  some  differences  between  EO  responses  and  El  responses 
were  also  observed.  However,  the  small  number  of  EO  responses  for  these  three  animals  combined  (N  = 
15;  less  than  half  of  the  omitted  case)  may  be  re^nsible  for  the  lack  of  statistically  significant  differences 
between  EO  and  EE/'EI  for  most  of  the  10  parameters, 

1  8 


The  data  were  also  subjected  to  a  subregional  analysis  in  which  the  dorso-  ventral  axis  of  AJ  was 
divided  into  three  subregions;  the  dorsal,  central,  and  ventral  portions,  based  on  the  sharpness  of  tuning 
distribution  (see  also  Schreiner  and  Sutter,  1992).  The  analysis  of  differences  between  binaural  response 
types  was  then  performed  for  each  subregion  separately.  Table  IV  illusffates  the  results  of  the  binaural 

analysis  for  all  three  animals  based  on  the  three  subregions. 

Spectral  Bandwidth  Parameters  (Q jodB  Q40dB’  broadband  transient  response).  In  general,  there 

was  a  weak  relationship  between  the  spadal  distribution  of  binaural  response  type  and  those  related  to 
integrated  excitatory  bandwidth.  The  more  broadly  tuned  dorsal  subregion  of  A1  contained  tltc  largest 
number  of  EE  neurons  and  the  smallest  number  of  El  neurons  (61%  vs.  35%,  respectively).  In  the  sharply 
tuned  central  region  the  percentage  of  EE  and  El  units  was  eaual  (46%  vs.  47%).  El  responses  outnumber 
EE  responses  in  the  ventral  subregion  by  approximately  51%  to  40%.  EO  units  were  more  often 
encountered  in  the  central  and  ventral  subregions  than  in  the  dorsal  portion.  However,  due  to  the  small 
number  of  EO  responses  encountered  in  the  three  cases  under  consideration  EO  responses  were  omitted 
from  the  regional  analysis. 

For  all  binaural  response  types  combined,  the  QiojJB  OdOdB  dorsal  and  ventral 

subregions  of  AI  were  significantly  below  those  for  units  located  in  the  central  region,  as  can  be  seen  in 
Table  IV.  Tire  s,ame  subregional  differences  for  Qjq^j  and  could  also  be  seen  for  the  EE  units 

alone.  While  the  El  units  showed  the  same  trend,  statistically  I’gnificant  differences  were  only  observed 
between  the  central  and  ventral  subregions  for  QiQdjj  and  between  the  dorsal  and  central  subregions  for 
Q40dB-  ‘he  sharpness  of  tuning  for  the  El  units  appeared  to  be  slightly  higher  than  for  the  EE 

units,  although  statistical  significance  of  this  differevicc  was  only  observed  for  Q4()(Jb  m  the  ventral 
subregion.  This  finding  of  sharpncss-of-iuning  differences  between  the  binaural  classes  within  subregions 
of  AI  suggests  that  the  difference  seen  for  the  global  analysis  (Table  III)  is  not  due  to  differences  in  spatial 
disuibution  between  the  binaural  classes  but  rather  reflects  a  general  difference  in  their  fircquency 
resolution  that  is  largely  indepc-ndcni  of  spatial  location. 

Another  response  parameter  that  appeared  to  be  highly  correlated  with  binaural  response  type  was 
b^via  Iband  transient  response.  Tables  III  and  IV  indicate  that  the  broadband  transient  response  diffci'cd 


significar.tly  between  EE  and  £1  neurons.  The  global  analysis  (Table  III)  indicated  that  £E  neurons 
responded  more  vigorously  to  broadband  stimuli  than  did  El  cells.  This  was  corifinncd  in  the  regional 
analysis  (Table  IV)  for  the  central  and  ventral  subregions.  However,  in  the  dorsal  subregion  El  neurons 
appeared  to  respond  better  to  broadband  stimuli  although  the  difference  between  EE  and  El  neurons  did 
not  reach  statistical  significairce. 

An  analysis  of  intenegional  differences  between  the  bandwidth -related  measures  appeared  to 
indicate  that  the  parameters  encountered  in  the  central  region  differed  from  those  in  the  dorsal  and/or 
ventral  subregions  for  both  El  and  EE  responses.  Since  the  division  of  AI  into  three  subregions  was  based 
on  a  bandwidth  measure  (sec  Figure  1 1),  this  finding  is  to  be  expected. 

Tone  Intensity  Parameters  (monotonicity,  threshold,  SRL,  dynamic  range).  Across  the  entire 
mapped  extent  of  XL,  the  .monotonicity  of  EE,  El,  and  EO  units  revealed  no  statistcally  significant 
differences  (Table  III).  However,  El  units  displayed  a  tendency  to  be  more  nonmonotonic  than  either  EE 
or  EO  units  (Table  III).  In  the  global  three-case  analysis,  a  significant  difference  between  EE  and  El 
thresholds  was  observed  such  that  El  neurons  were  found  to  be  more  sensitive  (i.e.,  had  lower  thresholds) 
than  either  EE  or  EO  units  (Table  III).  Significant  differences  were  also  seen  for  the  SRL  responses  in 
which  lower  SRLs  were  obtained  for  El  units  as  compard  to  EE  or  EO  units.  With  respect  to  dynamic 
range,  EE  neurons  appeared  to  have  a  slightly  broader  range  than  those  of  El  neurons. 

Regional  analysis  revealed  that  El  neurons  were  more  nonmonotonic  than  EE  neurons  in  the 
central  and  ventral  subregions  of  AI.  This  difference  was  statistically  significant  in  the  ventral  portion  of 
AI  and  had  a  statistical  trend  (p<0.  i)  for  the  central  region.  A  significant  difference  between  the  EE  and  El 
unit  thresholds  was  seen  in  the  ventral  portion  of  AI  (Tabic  IV)  with  El  neurons  cxliibiting  lower 
thresholds  than  EE  neurons.  Differences  between  .El  and  EE  neurons  were  most  clearly  demonstrable  for 
the  SRL.  in  this  case,  all  three  subregions  showed  significant  differences,  with  the  SRL  of  El  units  being 
7  to  13  dB  below  those  of  the  EE  neurons.  While  the  dynamic  range  of  El  neurons  tended  to  be  narrower 
for  El  responses  than  for  EE  responses,  the  difference  did  not  reach  statistical  significance  in  any  of  the 
three  subregions. 


A  number  of  interregional  differences  between  the  intensity  parameter  responses  was  observed  for 
EE  and  El  classes  individually,  as  well  as  for  both  classes  combined.  For  rnonotoncity  and  threshold, 
these  differences  indicate  that  unit  responses  located  in  the  central  region  differed  from  those  located  in  the 
dorsal  and  ventral  subregions.  For  the  SRL  and  the  dynamic  range  measures,  the  dorsal  subregion 
appeared  to  be  different  from  the  central  and  ventral  subregions. 

Temporal  Parameters  (FM  parameters).  The  global  analysis  of  temporal  parameters  showed  only 
an  EE-EI  difference  in  the  preferred  FM  speed  with  El  neiarons  preferring  slightly  slower  speeds  than  EE 
neurons.  In  the  regional  analysis,  this  finding  was  confined  to  the  central  portion  of  AI,  although  the 
ventral  subregion  showed  the  same  tendency.  There  were  no  significant  differences  between  binaural 
response  type  for  either  latency  or  preferred  FM  direction. 

Interregional  differences  were  apparent  for  all  temporal  parameter  responses.  The  onset  latencies  in 
the  central  region  were  the  shortest,  although  this  was  only  statistically  significant  in  the  four-case 
analysis.  Preferred  FM  speed  obsei-ved  in  the  centra!  subregion  differed  significantly  from  that  in  the 
ventral  subregion  for  EE  and  El  cells  and  for  both  binaural  response  types  combined.  When  both  types  arc 
considered  together,  tlie  dorsal  subregion  also  appeared  to  differ  from  those  in  the  central  subregion.  Foi' 
EE  units  in  isolation  or  combined  with  El  cells,  the  preferred  FM  direction  of  units  in  dorsal  AI  differed 
from  those  encountered  in  the  central  and  ventral  subregions  of  AI. 

In  summary,  distinct  differences  in  several  response  properties  of  El  and  EE  units  were  observed 
throughout  the  full  extent  of  the  isofrequency  domain  of  AI  as  well  as  within  .subregions  of  AI.  Generally, 
El  neurons  tended  to  exhibit  lower  response  thresholds,  lower  SRLs,  a  narrower  dyiiamic  range,  a  higher 
degree  of  nonmonotonicity,  sharper  tuning,  weaker  responses  to  broadband  stimuli,  and  preferred  slightly 
slower  FM  sw^ceps  than  EE  neurons.  This  suggests  that  there  are  stronger  monaural,  contralateral 
inhibitory  influences  acting  upon  El  neurons  than  upon  EE  neurons. 

Regional  differences  in  monaural  responses  between  the  dorsal,  central,  and  ventral  portions  of  AI 
appear  to  be  more  pronounced  for  EE  units  than  for  El  units.  This  is  reflected  in  the  fact  that  1 1  of  30 
regional  comparisons  showed  statistically  significant  differences  for  EE  neurons  whereas  only  6  of  30 
such  comparisons  yielded  a  statistically  significant  difference  for  El  neurons.  For  both  binaural  classes 
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combined,  15  of  30  comparisons  revealed  significant  regional  differences,  confirming  the  view  of 
systematic  functional  segregarions  in  the  isofrequency  domain  of  AI. 

Discussion 

The  purpose  of  the  present  paper  was  to  determine  and  compare  the  spatial  distribution  in  AI  of 
onset  latency  and  binaural  response  type  and  compare  them  to  the  10  paranjcters  previously  described  in 
this  series  of  reports  (Schreiner  and  Mcndelson,  1990;  Schreiner  et  al.,  1992;  Mendeison  et  al.,  1993). 
Similar  to  what  was  observed  in  most  other  parameters,  onset  latency  responses  were  distributed 
nonrandomly  across  the  conex  such  that  cells  with  longer  onset  latencies  were  typically  located  in  the 
dorsal  and  ventral  aspects  of  AI  while  those  with  shorter  latencies  were  more  often  found  in  the  central 
region  of  the  dorsal -ventral  extent  of  AI.  In  concurrence  with  earlier  reports  (Imig  and  Adrian,  1977; 
Middlebrooks  et  al.,  1980),  the  present  study  also  showed  that  near  threshold  binaural  responses  appeared 
to  be  dismbuted  as  alternating  elongated  bands  or  patches  of  EE  and  El  cells.  While  the  overall  spatial 
distributions  of  onset  latency  and  binaural  responses  were  similar  in  most  instances,  the  details  of  these 
spatial  patterns  varied  considerably  across  cases. 

Before  discussing  these  results  further,  the  issue  of  multi-  versus  single  unit  recording  should  be 
addressed.  As  this  issue  has  been  discussed  in  detail  elsewhere  (Schreiner  and  Mcndelson,  1990; 
Schreiner  et  al.  1992;  Mendeison  et  al  1993;  Schreiner  and  Sutter,  1992;  Sutter  and  Schreiner,  1995),  it 
will  be  treated  summarily  here.  Since  multi-unit  recording  samples  the  electrical  activity  of  several  neurons 
within  a  small  volume,  it  cannot  be  assumed  that  all  neurons  involved  have  identical  response  properties, 
nor  can  the  potential  contribution  of  thalamocortical  afferents  impinging  on  postsynaptic  cortical  cells  be 
excluded  (Schreiner  and  Cynader,  1984;  Schreiner  and  Mcndelson,  1990).  Thus,  both  multi-unit  onset 
latency  and  binaural  response  recorded  at  a  given  cortical  site  most  likely  reflect  the  dominant  pattern  of 
neural  activity  resulting  from  local  integrated  cortical  processing  at  that  location.  Moreover,  the  systematic 
variation  in  multi-unit  onset  latencies  and  binaural  responses  observed  along  the  dorsoventral  extent  of  AI 
is  more  easily  explained  as  the  consequence  of  intrinsically  distributed  cortical  processes  rather  than  as  the 
result  of  random  variation  in  multi-unit  responses. 


In  order  to  diminish  the  potential  effects  that  single  versus  multi-unit  recordings  might  impose  on 
the  results,  comparisons  of  single  and  multi-unit  onset  latencies  were  made  as  a  function  of  dorso-ventral 
location.  In  general,  it  appeared  that  the  onset  latencies  w'ere  shorter  for  the  multi-units  than  for  single 
units.  However,  upon  closer  inspection,  it  was  apparent  that  there  was  a  greater  scatter  among  the  .spatial 
distribution  of  onset  latencies  for  single  nnits  than  for  multi-units.  This  increase  in  scatter  of  single  unit 
responses  versus  multiple-unit  responses  has  been  observed  in  CF,  Q4CldB’  threshold  and 

monotonicity  measures,  particularly  in  the  ventral  region  of  AI  (Schreiner  and  Sutter,  1992;.  Sutter  and 
Schreiner,  1994).  Despite  the  small  difference  in  onset  latency  the  overall  spatial  distribution  for  single  and 
multi-unit  onset  latencies  was  similar. 

Onset  Latency 

The  average  onset  latencies  recorded  in  tlte  present  study  are  characteristic  of,  and  closely  resemble 
those  obtained  in  single  unit  studies  (Phillips  and  Irvine,  1981;  Phillips  et  al.  1985).  As  obseived  by  these 
researchers,  as  stimulus  intensity  was  increased,  onset  latency  became  shorter. 

As  mentioned  above,  f  comparison  between  single  and  multi-unit  onset  latencies  revealed  that 
multi-units  responded  with  shorter  latencies.  This  is  most  likely  due  to  tlie  fact  that  a  jpven  iniu  cluster  will 
consist  of  individual  cells  cxilubiting  both  short  and  long  latencies.  Thus,  when  recording  from  a  unit 
cluster,  neurons  displaying  shoner  latencies  wih  bias  the  aggregate  response  and,  thereby,  cause  the 
cluster  to  be  assigned  a  correspondingly  .?honer  latency  value. 

The  present  study  also  revealed  that  onset  latencies  were  not  randomly  distributed  in  AI.  In 
general,  units  with  shorter  onset  latencies  were  located  in  the  central  region  of  AI  while  units  witli  longer 
onset  latencies  were  typically  found  in  the  more  dorsal  and  ventral  subregions  of  the  cortex.  T)ic  spatial 
transitions  from  shorter  to  longer  latencies  and  vice  versa  were  sometimes  gradual  (e.g..  Fig.  4;  #87-001) 
and  other  times  quite  steep  (e.g..  Fig.  5b;  #87-518).  In  addition,  there  was  some  regional  variability  in 
onset  latency  values,  particularly  in  the  venffaJ.  portion  of  the  mapped  areas.  Several  investigators  have 
recently  reported  a  higher  degree  of  variability  of  vencrally  located  A,I  units  in  respon,se  to  other  stimulus 
parameters  (Reale  and  Kettner,  1986;  Imig  et  al.  1990’  Schreiner  ani  Mendelson,  1990;  Schreiner  et  al., 
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1992;  Mcndclson  et  ai.,  1993).  It  has  been  suggested  that  cells  located  in  the  ventral  portion  of  AI  may  be 
organizadonaiiy  or  functionally  distinct  from  those  located  in  the  central  and  dorsal  regions  of  the  conex 
(Reaie  and  Kettner,  1986;  Imig  et  al.  1990;  Schreiner  and  Mcndelson,  1990;  Schreiner  ct  al.,  1992; 
Mendelson  et  al.,  1993;  Schreiner  a.nd  Sutter,  1992).  The  greater  variability  in  onset  latency  observed  in 
the  present  study  provides  additional  support  for  this  suggestion.  The  spatial  extent  and  location  of 
aggregates  with  similax  latencies  appeared  to  vary  from  animal  to  animal  as  well  as  with  position  within  the 
isofrequency  domain  of  AI.  This  intercase  variability  was  also  observed  in  the  spatial  distribution  of  other 
response  properties  in  this  series  of  experiments  (Schreiner  and  Mendelson,  1990;  Schreiner  et  al.,  1992; 
Mendelson  et  al.,  1993). 

The  observation  that  onset  latencies  appeared  to  show  a  large  scatter  in  the  spatial  distribution  and 
varied  as  a  function  of  dorsoventral  location  suggests  that  latency  reflects  mechanisms  other  than  travel 
time  differences  along  die  basilar  membrane.  One  possible  contributing  factor  to  longer  onset  latencies 
observed  in  the  dorsal  region  of  AI,  is  whether  the  unit  has  a  multi-  or  single  peaked  response  profile. 
Sutter  and  Schreiner  (1991)  found  that  m  general,  neurons  with  multi-peaked  profiles  respond  with  longer 
latencies  than  single-peaked  neurons.  These  researchers  also  observed  that  the  niajority  of  multi-peaked 
neurons  were  located  in  the  dorsal  region  of  AI.  Thus,  the  presence  of  multi-peaked  neurons  may  help  to 
account  for  the  longer  onset  latency  responses  observed  in  the  dorsal  region  of  AI.  The  longer  latencies 
that  we  observed  in  the  ventral  region  are  most  likely  due  to  some  other  neural  mechanism  such  a.s  an 
increase  in  scatter  of  CF  (Sclueiner  and  Sutter,  1992),  stronger  inhibitory  influences,  or  inputs  from  areas 
with  longer  latencies. 

The  large  local  scatter  of  onset  latencies  found  for  muld-  and  single  units  in  Al  is  similar  to 
observations  made  for  CF  (Schreiner  and  Sutter,  1992),  threshold,  and  nonmonotonicity  (Sutter  and 
Schreiner,  1994).  This  supports  the  hypothesis  that  the  local  processing  networks  in  AI  can  draw 
information  from  widely  distributed  parameter  ranges  in  the  spectral  (frequency,  bandwidth  and  amplitude) 
and  temporal  (latency,  FM)  domain.  The  advantage  of  the  richtiess  of  these  locally  computed  parameters 
supcrm:posed  on  more  global  gradients,  may  be  related  to  the  fermation  of  U'anscortical  neural  assemblies 
involved  in  the  disrrib’ited  encoding  of  spectrally  and  temporally  complex  signals. 
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The  orderly,  global  spatial  distribution  of  onset  latencies  observed  in  the  present  study  does  not 
appear  to  be  unique  to  area  AI.  Schreiner  and  Langner  (1988)  obseived  that  onset  latencies  were 
systematically  distributed  in  the  central  nucleus  of  the  inferior  colliculus  (ICC)  of  tire  cat.  However,  one 
potentially  imponant  difference  between  the  spatial  distribution  of  onset  latencies  in  thes^  two  auditory 
structures  is  that  latency  appears  to  be  largely  independent  of  CF  in  AI,  at  least  over  the  CF  range  tested  in 
this  study,  while  in  ICC  the  gradient  of  onset  latencies  appears  to  be  only  weakly  correlated  with  Cl", 
suggesting  that  the  travelling  wave  mochanism  may  be  contributing  more  to  the  latency  In  ICC  than  to  the 
one  observed  in  AI  (Langner  et  al.  1987). 

A  systematic  distribution  of  onset  latencies  has  also  been  observed  in  other  species  such  as  the 
domestic  chick,  Heil  and  Scheich  (1991)  found  that  not  only  did  onset  latencies  decrease  with  ?ai  increase 
in  frequency,  but  that  they  were  also  topographically  distributed  within  Held  L,  the  avian  homologue  of  the 
auditory  cortex. 

Binaural  Response  Type 

In  the  present  study,  alternating  elongated  patches  or  bands  of  EE  and  El  cells  were  distributed 
ortliogonally  to  CF,  This  pattern  of  spatial  distribution  is  consistent  with  observations  report.ed  by  other 
investigators  (Imig  and  Adrian,  1977;  Imig  and  Brugge,  1978;  Middlebrcoks  el  al.,  1980;  Schreiner  and 
Cynader,  1984;  Kelly  and  Judge,  1994).  More  recently.  Reale  and  Kettner  (1986),  found  that  in  the 
ventral  region  of  AI  a  somewhat  different  pattern  of  binaural  responses  emerged  when  tested  with 
interaural  intensity  disparity  (HD)  stimuli  as  opposed  to  the  more  traditional  binaural  equal  intensity 
stimuli.  With  UDs,  they  found  a  disproportionate  number  of  cells  in  the  ventral  .subregion  that  exthibited 
both  facilitatory  and  inhibitory  response  propenics  when  tested  with  IIDs  as  compared  to  equal  intensity 
stimuli.  This  segregation  in  binaural  responses  between  tlie  ventral  subregion  and  the  rest  of  die  cortex 
lends  further  support  to  the  notion  tliat  there  may  be  a  functional  partition  between  cells  located  in  the 
ventral  portion  of  AI  as  compared  to  those  located  mom  centrally  or  dorsally. 

Anatomical  suppon  for  the  segregation  of  binaural  resopnse  types  can  be  derived  from  earlier 
reports  which  have  shown  that  the  thalamic  projections  to  Ai  are  topographically  organized  (Anderson  et 
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al..  1980;  Mcrzcnich  ct  al.,  1982;  MiddIelMt)oks  and  Zook,  1983;  Morel  and  imig,  1987).  Middlebrooks 
and  Zook  (1983)  showed  tliat  diffenrnt  El  bands  receive  input  from  coincident  thalamic  populations  while 
different  EE  bands  derive  input  from  different  subdivisions  of  a  continuous  EE-projecting  rone.  However, 
Brandner  and  Rcdics  (1990)  have  tecendy  suggested  that  cordcal  EE  and  El  bands  do  no;  receive 
coincident  thalamic  projections.  Instead,  their  data  suggest  that  different  subregions  of  a  cortical 
isofrcquency  contour  may  receive  inputs  from  diffcicnt  portions  of  the  corresponding  tiialamic 
isofrcquency  contour.  Brandner  and  Redies  (1990)  reponed  that  the  rostral  part  of  the  ventral  nucleus  of 
the  medial  geniculate  body  (MGB)  projects  to  neurons  located  in  the  dorsal  region  of  the  cortex  while  the 
caudal  region  of  the  MGB  projects  to  more  ventrally  located  cortical  cells.  These  geniculate  projections 
appear  to  be  independent  of  binaural  response  properties  and  may  provide  .some  insight  into  the  functional 
gradients  that  have  been  observed  in  the  isofrcquency  domain  (Schreiner  and  Mendelson,  1990;  Schreiner 
et  al.,  1992;  Mendelson  ct  al.,  1993;  Heil  et  al.,  1992).  However,  a  more  thorough  re-evaluation  of  the 
thalamocortical  projection  patterns  is  now  required  in  light  of  the  functional  distinctions  which  are  apparent 
between  different  subregions  of  Al. 


Comparison  of  the  Spatial  Distribution  of  Response  Properties  in  Al 

It  is  well  known  that  the  conex  is  tonoiopically  organized  and  that  alternating  patches  of  EE  and  El 
cells  are  distributed  orthogonally  to  CF  (Rose  and  Woolsey,  1948;  Mcrzcnich  et  al.,  1975).  Recently, 
several  investigators  have  reported  that  a  number  of  different  response  parameters  appear  to  be  organized 
in  an  orderly  manner  along  the  isofrcquency  axis  of  AL  For  example,  there  have  been  reports  which 
suggest  that  a  subset  of  sound  localization  response  propierties  may  be  spatially  distributed  within  .41 


(Miuulebrooks  and  Pettigrew,  1981;  Jenkins  and  Mcrzenich,  1984;  Rcalc  and  Kctcner,  1986;  Imig  ct  al. 


1990;  Rajan  et  al,  1990).  In  addition,  th.c  spatial  distributions  of  other  stimulus  response  parameters,  such 
as  integrated  excitatory  bandwidth,  bro.adband  transient  response,  FM  sweep  direction  and  speed 
selectivity,  stimulus  intensity,  inhibitory  sidebands,  and  multipeakcd  responses,  have  also  been  related  to 
the  isofrcquency  domain  of  Al  (Schreiner  and  Mendelson,  1990;  Sutter  and  Schreiner,  1991;  Schreiner 
and  Sutter,  1992;  Heil  et  al.  1992;  Schreiner  et  al.,  1992;  Mendelson  et  al.  1993;  Shamma  et  al.,  1993). 
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For  ihc  purposes  ui  the  present  discussion  only  those  spatial  distiibunons  obtained  from  'he  currciii 
cxperinicntal  senes  that  arc  significantly  coirclared  ssith  Intcncy  or  luru^aral  responses  wiH  be  discussed.  A 
global  correlational  analysis  of  all  ivvelvc  response  parameters  studied  will  be  provided  ui  :hc  fiiu'  report  in 


this  senes. 


Onset  latency  verini  spectral,  intensity .  and  temporal  response  properties  rt  corded  at  the  single  cortical 
locations 

Spectral  Bandwidsh.Paratncien.  In  the  present  study  a  significant  correlation  between  onset  latency 
and  QiOdB  found  for  two  of  the  three  cases.  This  finding  suggests  tiiat  narrowly  tuned  cells  respond 

with  shorter  onset  latencies  whereas  more  broadly  tuned  cells  ( it  least  for  broadly  tuned  cells  located  in  the 
dorsal  region  of  AI)  may  lespond  with  longer  latencies.  This  is  consistent  with  results  reposted  by  Kelly  ct 
al  (1986)  who  found  that  broadly  tuned  cells  in  ferret  auditory  cortex  had  minimum  first  spike  latencies 
that  were  5- 10  ms  longer  than  more  narrowly  tuned  cells. 

A  similar  finding  was  observed  for  these  same  two  cases  described  in  the  preceding  paragraph 
when  onset  latencies  were  correlated  with  Q40(1B-  ^  (f^87’518)  for  which  these  correlations 

could  be  analyzed  (and  which  were  found  to  be  nonsigniflcam)  conuuned  two  regions  in  which  units 
responded  with  shorter  onset  latencies  and  three  regions  where  unit',  displayed  longer  onset  latencies  The 
presence  of  these  additional  regions  of  shorter  and  longer  on»;t  latencies  may  have  contributed  to  the 
nonsignificant  correlation  found  between  these  tw'o  response  properties  for  this  case.  Despite  the 
significant  statistical  correlations  between  onset  latency  and  QjodB  Q40dB'  functional  relationship 


between  the  integrated  excitatory  baridwidth  and  onset  latency  is  not  immediately  obvious. 
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all  three  cases  studied.  When  a  comparison  of  the  spatial  distribution  oi  these  two  response  parameters 
was  made  (Fig.  6;  see  also  Schreiner  and  Mendelson  1990,  Fig.  15)  it  became  clear  that  those  cortical 
regions  containing  cells  responding  most  vigorously  to  broadband  trans.icnts  coincided  with  those  regions 
containing  cells  exliibiting  longer  onset  latencies. 


Tone  Iniensin  Parameters.  Onset  latency  was  found  to  be  sigiiiticantly  cov^Hslatcd  with 
monoiomaty  fu  two  oi  the  three  cases  <sxatruned.  Tliis  is  not  surprifing  given  that  monotonicity  is 
significantly  i  orrclatcd  with  Q4(vjiB-  As  mendoned  above,  cells  with  high  Q40(j3  values  were  found  to 
respond  with  shorter  latencies  than  cells  with  iowet  Q40iB  values.  TIjs  sugf^e'’ts  thai  cells  with  high 

degrees  of  nonmonotojiicity  tend  to  have  slightly  shorter  onset  latencies.  This  observation  stands  in 
conu-ast  to  the  results  of  Phillips  ct  al.  (5985)  who  found  that  nonmonotonic  cells  had  significantly  longer 
mean  onset  latencies  than  monotonic  cells.  One  possible  explanation  fc:  the  apparent  discrepancy  between 
our  results  and  those  of  Phillips  ct  ai.  (1985)  may  be  due  to  the  fact  that  we  adopted  a  different  measure  of 
monotonicity.  We  used  a  continuous  scale  for  the  assessment  of  monotonicity,  namely  the  slope  of  flic 
high  level  segment  of  the  ratc/levci  function,  as  compared  to  Phillips  and  colleagues  who  used  a  discrete 
two-category  classification  sclicmc.  Consequently,  our  measure  resulted  in  a  a  larger  number  of  cells  being 
classified  as  nonmonotonic  than  reponed  by  Phillips  and  colleagues.  Thus,  the  larger  number  of 

nonmonotonic  cells  may  yield  different  results  with  regard  to  onset  latency.  In  addition,  although  it  is  true 
that  many  nonmonotonic  neurons  also  have  high  vaiues,  this  relationship  docs  not  hold  for  all  AI 

neurons.  Particularly  in  the  dorsal  and  ventral  subregions,  fairly  broadly  tuned  nonmonotonic  neurons 
could  be  encountered.  Many  of  these  more  broadly  tuned  nonmonotonic  units  located  in  the  dorsal  legion 
of  AI  may  also  have  responded  with  shortei  latencies.  Additionally  not  every  sharply  tuned  neuron  was 
nonmonotonic,  i.e.,  of  two  neurons  with  equally  narrow  bandwidth  which  differ  only  in  monotoncity,  tlic 
one  with  the  higher  nonmonotonicity  likely  has  the  longer  latency,  Therefore,  it  can  be  concluded  that  the 
overall  U-shaped  distribution  of  onset  'atencies  is  in  register  with  the  distribution  of  sharpness  of  tuning. 

In  addition,  this  distribution  is  locally  modulated  by  influences  related  to  the  monotonicity  of  the  response 
properties  with  more  nonmonotonic  neurons  having  slightly  longer  onset  latencies. 

Temporal  Parameters.  The  present  results  showed  that  there  was  a  significant  correlation  between 
onset  latency  and  preferred  direction  of  FM  sweep.  For  two  of  the  three  eases,  cells  preferring  downward- 
directed  sweeps  tended  to  respond  with  shorter  onset  latencies.  Since  downward  sweeps  start  at  the  basal, 
shon  latency  end  of  the  basilar  membrane,  it  is  conceivable  that  the  observ'eu  correlation  is  a  reflection  of 
that  circumstance.  However,  the  functional  implications  of  this  finding  are,  at  present,  unclear  and  may 


reflec*  more  general  (and  presently  not  well  unJcrstooa)  tcmpor-i'  processing  mechanisms  governing  FM 
sweep  direction  selectivity  in  Al. 

B'naura!  response  typn  versus  spectral,  intensity,  and  temporal  response  properties  obtaiiedat  s.he  same 
cortical  locations 

Before  discussing  these  rcselts,  a  potential  difficulty  with  relating  binaural  response  type  to  othc) 
response  parameters  within  each  case  must  be  addressed.  In  oiricr  to  accurately  assess  the  functional 
relationship  between  binrural  responses  and  other  response  parameters,  the  preferred  mcUiod  of  analysis 
would  be  to  correlate  a  continuous  measure  of  binaurality  with  the  other  continuously  distributed  response 
parameters.  However,  unliJee  the  other  response  properties  which  result  in  a  continuum  of  responses  (c.g., 
for  Qio-dB  there  is  a  gradual  progression  from  narrowly  tuned  in  the  central  regions  to  broadly 

tuned  in  the  ventral  and  dorsal  regions  of  the  cortex),  bbiaural  responses  as  identified  in  this  (and  typically 
in  other  studies)  study  form  into  di.scrcte  clusters:  a  given  unit  at  a  given  cortical  site  is  either  EE,  El.  or 
EO.  Although  the  correlation  between  a  continuous  variable  and  a  discrete  variable  can  provide  certain 
insights  into  their  relationship,  we  restricted  ourselves  here  to  a  comparison  of  the  parameter  distributions 
for  the  three  non-continuously  distributed  binaural  rcsi>onsc  classes. 

Several  investigators  (eg.  Mcrzeiiich  ct  al.,  1982;  Middlebrooks  and  Zoiok,  1983)  have  suggested 
that  contiguous  pairs  of  EE  and  El  bands  may  form  functional  units  within  Al.  This  argument  suggests 
that  pairs  of  EE/EI  bands  should  be  analy^^  separately  from  other  adjoining  pairs  with  regard  to  the  other 
1 1  response  properties  examined  in  our  experiments.  However,  there  is  not  yet  sufficient  evidence  to 
support  the  hypothesis  that  u  contiguous  pair  of  ES/EI  cortical  bands  is  indeed  a  functional  unit.  Adopting 
this  assumption  p  rctiiuturc  ly  would  also  not  conform  well  to  this  type  of  analysis  because  the  absolute 
number  of  cells  encountered  within  a  pair  of  bands  may  be  quite  small  for  some  ammals,  as  was  the  case 
in  the  present  study,  and,  therefore,  may  not  be  suited  for  the  kind  of  global  statistical  analysis  employed 
here.  Instead,  we  opted  for  combining  the  binaural  analysis  for  four  extensively  mapped  cases  in  order  to 
increase  the  N  and  thereby  strengthen  the  analysis.  However,  since  the  global  picture  of  binaural  band 
shape  and  distribution  is  quite  different  from  animal  to  animal,  wc  did  not  aiialyzc  the  data  band-  by-band 


but  rather  across  all  of  A1  or  separately  for  rhe  oorsal,  central,  and  ventral  subregions  of  AI.  AI  was 
div  ided  inic  the  three  subregions  based  ca  the  dissnbuiion  i-f  sharpness  of  tuning  of  multi-unit  responses. 
A  similar  division  ha  ;  prcviou.viy  been  successfully  e.nployed  (Sutter  and  Schreiner,  1991,  1995; 
Schreiner  and  Sutter.  1992).  The  subregional  analysis  is  necessary  in  order  *o  distinguish  be  ween  global 
differences  between  EE  and  El  neurons  which  are  based  jn  differences  in  their  spatial  distribution  relative 
to  some  empirically  determined  response  ^rad’cnt  (e.g.  sharpness  of  tuning)  and  on  tho.se  differences  that 
are  also  locally  distvmable  and  that  may  correspond  to  differences  in  cell  morphology  and  connectivity. 

Finally,  the  inicrpreution  of  the  observed  differences,  or  lack  thereof,  between  the  binaural 
response  types  is  complicated  by  tV:  o  factors.  First,  a  single  naramc^er  cati  vary  over  a  ccnsidcrable  range 
across  the  isofrequency  domain  of  AI  as  demonstrated  by  the  nonuniform  spatial  distribution  for  most  of 
the  investigated  parameters.  Consequently,  a  large  variance  due  to  global  spatial  variations  may  obscure 
local  differences  between  binaural  response  types  in  regions  with  much  smaller  local  variance  of  a  given 
oi'ramctcr.  Second,  the  nonuniform  spatial  distribution  of  the  response  patumeters  may  covary  with  spatial 
variation.^  of  binaural  response  classes,  so  that  global  differences  between  parameters  are  due  to  biases  in 
the  spatial  distribution  of  the  binaural  responses.  For  otarnplc,  r’.c  significantly  larger  values  of  (240dB 

El  neurons  across  three  of  the  oases  (sec  Table  HI)  may  be  due  to  the  fact  that  more  El  responses  are 
located  in  the  sharply  tuned  regions  of  AI  and  more  EE  responses  in  the  less  sharply  tuned  regions  of  AI, 
while  differences  between  FE  and  El  neurons  in  cither  the  sharply  or  the  broadly  tuned  regions  may 
actually  be  negligible.  It  is  difficult  to  avoid  any  biases  from  the  spatial  distributions  of  the  various 
parameters,  since  the  location  and  identity  cf  different  binaiwal  aggregates  is  not  uniform  from  one  animal 
to  the  next  (see  Figure  9).  Therefore,  we  could  not  perform  an  analysis  separately  for  different  Vi  Cll 
defined  binaural  regions  such  as,  for  example,  a.  B,  or  yEI  batids  (Imig  and  Brugge,  1978). 

Spectral  Bandwidth  Parameters.  In  the  global  analysis  we  observed  that  040^3  values  for  El  cells 

were  slightly  higher  than  values  for  EE  cells  suggesting  that  El  cells  are  more  sharply  tuned  than  EE  cells. 
A  trend  similar  to  that  seen  for  040^3  was  observed  for  QiOci3  values,  although  it  did  not  reach  statistical 

significance.  In  the  regional  analysis,  the  difference  in  sharpnc.vs  of  tuning  between  El  and  EE  was  most 
clearly  expressed  in  ventral  AI.  Interregional  differences  in  accordance  with  previously  described 
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differences  heween  dorsal,  central  and  ventral  A!  were  seen  for  both  and  Q40(i3  for  EE  as  well  as 

Ei  ncurotis. 

Another  parameter  closely  related  to  the  breadth  of  the  tuning  curve  profile  is  the  re:''onse  strength 
to  broadband  transient  stimuli  (see  Scl^eincr  and  Mcndelson,  i99C;.  As  was  observed  with  040^3,  “ 

clear  difference  in  the  ’•esponse  strength  of  the  Et  versus  the  HE  units  was  apparent  with  El  units  e>:hibiting 
weaker  responses  globally  as  well  as  vcntrally  and  centrally.  In  contrast,  Ei  units  in  dorsal  AI  exliibited 
stronger  responses  to  broadband  sumuli  than  did  EE  units. 

Tone  Intensity  Parameters.  All  foui  pai’amctcrs  that  assess  the  response  of  crruc-il  neurons  to  tones 
of  varying  intensity  levels  revealed  diffeienccs  between  EE  and  EI  sites  both  in  uie  global  and  the  regional 
analysis.  The  most  consistent  differences  were  seen  for  the  SRI-  where  the  EE  units  showed  reladvely 
higher  values  in  each  region.  A  similar  trend  was  seen  for  response  thresholds,  suggesting  that  EI  cells 
may  have  lower  contralateral  monaural  thresholds  that  arc  at  least  5  dB  lower  than  EE  ccUt:.  Schreiner  and 
Cynader  ( 1984)  reported  a  similar  result  in  their  investij’ation  of  multi-units  in  AI  and  All.  This 
relationship  is  perhaps  not  tco  surprising  when  one  considers  that  bctli  binaural  responses  and  threshold 
responses  of  high  and  low  threshold  aggregates  were  scattered  thrctjghout  the  etidic  extent  of  the  mapped 
area.  The  functional  significance  of  this  relationship  awaits  fitriher  investigation. 

Monotonicity  values  for  EI  units  located  m  the  central  and  ventral  subregions  as  well  as  across  the 
entire  cortical  region  exairiincd,  were  generally  lower  than  those  of  EE  units.  This  suggests  that  EI  cells  are 
more  nonmonotonic  than  EE  cells.  This  is  in  contrast  to  Phillips  ct  al  ( 1985)  who  reported  that  while 
monotonic  and  nonmonotonic  ceils  were  segregated  along  the  dorsoventral  extent  of  AI,  they  did  not 
appefir  to  coincide  with  any  particular  binaural  response  type.  In  other  words,  they  found  tiiat  the  spatial 
distribution  of  monotonicity  appeared  to  be  independent  of  binaural  response,  As  mentioned  above,  we 
used  a  different  mctliod  for  assessing  monotonicity  from  that  used  by  Phillips  and  colleagues  (1985), 
namely  the  slope  of  the  high  level  segment  of  raic/lcvel  functions.  Thus,  our  measure  of  monotoniciry  may 
be  more  scn.sitive  to  the  finer  differences  that  undoubtedly  exist  between  binaural  rc,sponsc  types.  Finally, 
the  observed  differences  between  .EE  and  EI  units  with  regard  to  monotonicity  should  be  iuteipretcd  with 
some  caution  since  Lhey  may  strongly  reflect  cluster  composition  effects.  Recently,  Sutter  and  Schreiner 


29 


(1995)  shcv/cd  thaf  tiic  predictive  value  of  multi-unit  monotonicity  measures  for  the  underlying  single  uni: 
propcnies  is  limited. 

Temporal  Pnrameiers.  With  regard  to  FM  sweep  rc'jponscs.  binaura!  response  types  were 
correlated  differently  with  preferred  speed  of  FM  sweeps.  EE  cells  preferred  faster  FM  sweeps  than  did  El 
cells.  This  relationship  held  in  botf  the  global  analysis  as  well  as  for  an  analysis  restiicted  to  the  central 
region  of  Al. 

Collectively,  seven  of  the  twelve  monaural  response  paiameters  investigated  in  this  series  of 
experiments  displayed  statistically  significant  differences  between  the  two  main  binaural  response  types, 
EE  and  El,  eith'V;r  in  the  global  and/or  in  the  regionrJ  analysis.  The  resp^-nse  paiameters  that  did  not  exhibit 
significant  differences  were  QiqqBi  latency.  prefcriTd  FM  direction,  and  CF,  Thus,  it  can  be  staled  that  El 

units  exhibited  a)  sharper  tuning  40  dB  abewe  threshold,  b)  less  responsiveness  to  broadband  transient 
stimuli,  c)  lower  minimum  thresholds,,  d)  lower  SRLs,  e)  stronger  nonmonotonicity,  0  narrower  dynamic 
range,  and  g)  preferred  slower  FM  speed  than  EE  neurons.  These  results  lead  us  to  speculate  that  the 
functional  distinctions  between  EE  and  El  neurons  suggests  the  existence  of  two  largely  independent 
processing  streams  in  the  auditory  cortex  that  may  be  distinguishable  by  the  pattern  of  their  thalamocortical 
inputs  (Middlcbrooks  and  Zook,  1983)  and  their  callosal  connections  (Imig  and  Brugge,  19  /8).  The  EE 
stream  appears  more  broadly  tuned  for  spectral  and  intensity  information  and  thus  may  emphasize 
integrative  processing,  wliiic  the  El  stream  seems  more  sharply  tuned  for  spectral  and  intensity 
information,  potentially  reflecting  greater  differentiation  of  input  signals. 

Most  response  properties  of  El  neurons  (including  the  fundamental  basis  of  their  nomenclature) 
seem  to  be  compatible  with  tlie  suggestion  that  these  tmits  have  a  slightly  stronger  inhibitory  influence  on 
the  shaping  of  the  overall  unit  response  profile.  It  is  unlikely,  however,  that  this  inhibition  is  provided 
through  inadvertent  (eg.,  through  crosstalk)  input  originating  from  the  ipsilaiural  inhibitory  pathway.  This 
is  based  on  the  fact  that  the  acoustic  crosstalk  between  the  two  cus  with  a  closed  sound  delivery  system  is 
approximately  -4()dB:  that  is,  the  signal  reaching  the  ipsilaterai  car  via  crosstalk  is  at  least  40  dB  less  than 
the  contialateral  signal.  Although  the  crossed  input  signal  may  contribute  to  the  response  profile  at  higher 
intensities,  it  is  unlikely  that  prcpertics  such  as  threshold  itself,  dynamic  range,  type  of  monotonicity,  and 
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sharpness  of  tuning  that  have  already  been  established  at  20  to  40  dB  above  threshold  are  affected  by  this 
crosstalk.  Thus,  it  is  probable  that  these  functional  di.Terences  reflect  genuine  distinctions  between  the 
processing  that  occurs  within  these  two  projection  pathways. 

Finally,  the  present  study  has  revealed  what  appears  to  be  a  difference  in  the  global  spatial 
distributioti  of  the  response  parameters  for  the  two  binaural  classes.  While  EE  neurons  expressed 
differences  in  the  response  parameters  between  the  dorsal,  central,  and/or  ventral  subregions  in  1 1  out  of 
30  possible  comparisons  (37%,  sec  Table  IV),  this  was  only  observed  in  6  out  of  30  cases  (20%)  for  E! 
neurons.  This  observation  suggests  that  the  spatial  differences  for  EE  neurons  are  more  strongly  expressed 
than  those  for  El  neurons.  However,  the  relatively  low  number  of  El  neurons  in  our  .sample  of  dorsal  AI 
may  have  conrri.butcd  to  a  lack  of  significance  in  a  number  of  possible  comparisons.  By  using  a  larger 
sample  size,  more  significant  differences  may  become  apparent  for  El  neurons  as  well. 

As  mentioned  above,  it  has  been  argued  (e.g.,  Merzcnich  ct  al.,  1982;  Middlebrooks  and  Zook, 
1983)  that  binaural  bands  in  the  high-frequency  domain  leflect  some  kind  of  specialization  of  different 
populations  of  neurons  with,  for  example,  emphasis  of  coding  of  spatial  information  in  El  locations  and 
emphasis  of  other  perceptually  important  aspects,  such  as  spectral  distribution,  in  EE  bands.  The  global 
differences  described  here  for  the  EE  and  El  sites  suppon  .such  a  view.  The  question  of  why  tlicre  are 
several  EE  and  El  aggregates  in  AI  may  relate  to  the  necessity  of  having  EE-  and  El-typc  processing 
carried  out  for  different  bandwidth,  intensity  and  temporal  conditions.  By  distributing  several  bands  across 
the  isofrequency  domain,  it  is  assured  that  regions  of  sharp  and  narrow  tuning,  regions  of  high  and  low 
thresholds  or  monotonicity  and  regions  of  rapidly  changing  frequency  (eg.  FM  sweep  speed)  ase  equally 
covered. 

Concluding  Remarks 

The  present  report  provides  a  comprehensive  description  of  the  spatial  distribution  of  the  final  two 
out  of  twelve  stimulus  response  properties  previously  reported  in  cat  AI  (Schreiner  and  Mcndclson,  1990; 
Schreiner  ei  al.,  1992;  Mcndclson  ci  al.,  1993).  The  results  obtained  from  these  experiments,  as  well  as 
those  from  other  investigators  (Jenkins  and  Merzenich,  1984;  Imig  et  al.,  1990;  Rajan  ct  al.  1990;  Hcil  ct 
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al..  1992;  Shamma  ct  a!.,  1993)  have  provided  us  with  a  greater  understanding  of  the  functional 
organization  of  primary  auditory  cortex.  In  general,  the  majority  of  response  properties  observed  in  our 
experiments  appear  to  be  organized  along  the  dorsoventral  axis  although  this  organization  does  not  appear 
to  be  very  strict  (Schreiner  and  Sutter,  1992,  Sutter  and  Schreiner  1995).  litis  suggests  that  there  may  be 
local  processing  within  a  given  isofrequcncy  band  for  at  least  some  oi  tltese  properties.  Suppon  for  this 
notion  of  local  processing  can  be  derived  from  tlic  investigations  into  the  intrintiic  cortical  connections  of 
AI  (Matsubara  and  Phillips,  1988;  Luethke  ct  al.,  1989;  Wallace  ct  aJ.,  1991).  Matsubara  and  Phillips 
(1988)  were  the  first  to  show  that  unlike  callossal  connections,  labelled  cells  were  found  in  a  variety  of 
binaural  response  types,  rcgai'dless  of  the  homogeneity  of  the  injection  site.  They  also  found  that  labelled 
cells  occurred  in  patches  of  the  same  or  higher  CFs.  Wallace  ct  ai.  (1991 )  found  that  while  labelled  cells 
were  often  located  within  an  isofrequcncy  band,  they  could  also  be  found  in  anterior  and  posterior 
locations.  These  results  suggest  that  local  processing  may  occur  within  an  isofrequcncy  contour  and  that 
this  processing  is  largely  but  not  completely  independent  of  binaural  intcr8.ction  response. 

The  presence  of  a  number  of  response  patameters  that  appear  to  be  systematically  organized  within 
AI  may  provide  a  set  of  criteria  by  which  the  dorsoventral  extent  of  AI  can  be  delineated.  With  the 
exception  of  binaural  response,  changes  in  the  distribution  of  other  response  features,  c.g„  onset  latency., 
arc  gradual  on  a  global  scale,  but  can  be  quite  varia'^’c  on  a  local  scale.  However,  they  may  not  necessarily 
provide  an  unequivocal  set  of  criteria  that  would  allow  for  a  clear  demarcation  of  the  borders  to  areas 
neighbouring  dorsal  and  ventral  AI. 

On  a  funcdonal  level,  the  systematic  distribution  of  these  (and  perhaps  other  as  yet  unknown) 
response  parameters  may  reflect  the  representational  basis  for  the  dctccdon  and  idcndficadon  of  specific 

features  of  sounds  arising  from  the  natural  environment.  Schreiner  and  Sutter  (1992)  have  recently 
suggested  that  the  region  in  Aid  (dorsal  to  the  peak  )  may  be  better  suited  for  integrative  analysis  of 

broadband  stimuli  by  responding  in  a  relatively  undiffcicndatcd  mBnncr  to  tones  or  spectral  peaks  of 
different  frequencies.  This  is  supported  by  the  observation  that  cells  in  this  subregion  tend  to  be  more 
broadly  tuned  and  to  have  a  high  responsiveness  to  broadband  stimuli  such  as  clicks.  In  contrast,  AIv  may 
be  better  suited  for  differential  analysis  of  the  spectral  properties  of  broadband  stimuli  since  units  thae  arc 
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characterized  by  sharp  tuning  which  is  distributed  over  varying  ranges  of  CFs  arid  which  do  not  seem  to 
respond  to  broadband  stimuli  as  well  as  cells  in  dorsal  AI. 
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Figure  Legends 


Fig,  lA-C.  The  spatial  discibution  of  CF,  integrated  excitatory  bandwidth,  expressed  as  2nd 

threshold  in  AI  for  case  #86-173.  Two  dimensions  of  the  pseudo  three-dimensional  projection  represent 
the  anteroposterior  (a,p)  and  dorsoventral  (d,v)  extent  of  the  conical  surface.  The  arrowheads  along  the 
anteroposterior  dimension  indicate  the  approximate  orientation  of  the  isofrequency  contours.  A  Spatial 
disoibution  of  CF,  7  he  elevation  of  the  surface  of  the  plot  corresponds  to  frequency.  As  illustrated,  CF 
increased  monotonically  as  the  electrode  was  advanced  in  a  ro.stral  to  caudal  direction.  The  sudden  increase 
of  CF  in  the  ventral  portion  of  the  map  indicates  the  transition  to  the  second  auditory  field  (All;  Schreiner 
and  Cynader,  1984).  B  The  spadal  distribution  of  QiOji£  The  elevation  of  the  surface  plot  corresponds  to 
the  magnitude  of  QjocIb  across  the  mapped  cortex.  Low  values  of  correspond  to  regions  where 

units  had  relatively  broad  integrated  excitatory  bandwidths  while  high  values  coircspond  to  a  region  where 
units  had  relatively  narrow  uandwidths.  The  scale  bar  for  the  axis  is  only  accurate  for  the  nearest 

corner  of  the  plot  because  of  the  distortion  in  the  projection.  As  can  be  seen,  QjodB  was  systematically 
distributed  across  the  cortical  surface  studied.  C  The  spatial  distribution  of  minimum  response  thresholds. 
Elevation  corresponds  to  the  magnitude  of  the  threshold.  ITiresholds  were  typically  lower  in  the  central 
region  of  the  map.  In  addition,  an  increase  of  threshold  along  the  cochleotopic  gradient  toward  higher  CFs 
was  also  observed.  All  data  depictea  in  the  pseudo  three-dimensional  plots  were  subject  to  a  smoothing 
factor  of  0.9  resulting  in  a  small  amount  of  additional  smoothing  (see  Schreiner  and  Mcndelson,  1990). 

Fig.  2A-D.  Representative  examples  of  multiple  unit  onset  latency  responses  as  a  function  of  stimulus 
intensity  level  re  minimum  threshold  (0  dB).  In  general,  onset  latencies  were  longer  at  lower  intensities. 

Fig.  3A-D,  Scatter  diagrams  of  on.set  latencies  as  a  function  of  characteristic  frequency  (CF).  In  general, 
onset  latency  appeared  to  be  independent  of  CF. 


Fig.  4.  Spatial  distribution  of  onset  latencies  obtained  at  30  -  40  dB  above  threshold.  The  upper  ponion 
depicts  a  pseudo  three-dimensional  projection  of  the  spatial  distribution  of  onset  latencies  in  AI  for  case 
#87-001.  As  with  the  pseude  three-dimensional  projection  of  integrated  excitatory  bandwidth,  two 
dimensions  of  the  projection  represent  the  anteroposterior  (a,p)  and  dorsoventral  (d,v)  extent  of  the  cortical 
surface.  The  arrowheads  along  the  anteropo.sterior  dimension  indicate  the  approximate  orientation  of  the 
isofrcquency  contour.'..  The  elevation  of  the  surface  of  the  plot  corresponds  to  the  onset  latency  (expressed 
in  ms)  across  the  mapped  cortex.  In  the  lower  portion  of  the  figure,  the  numbers  in  the  box  represent  the 
actual  onset  latency  values  that  underlie  the  pseudo  three-dimensional  spatial  reconstruction.  As  illustrated, 
onset  latencies  were  systematically  distributed  aaoss  the  cortical  surface  examined. 

Fig.  5A-C.  Additional  examples  of  the  spatial  distribution  of  onset  latencies.  Spatial  distribution  of  onset 
latency  for  A  case  #86-173,  B  case  #87-518  and  C  case  #87-706.  In  all  three  cases,  there  was  a 
nonrandom  spatial  distribution  of  unset  latencies.  Conventions  are  identical  to  those  in  Fig.  4. 

Fig.  6A,B.  Comparison  of  the  spatial  distribution  of  (A)  single  unit  (SU)  and  (B)  multiple  unit  (MU) 
onset  latencies  (determined  at  30  dB  above  threshold)  along  the  dorsoventral  axis  of  Al.  These  spatial 
distributions  are  depicted  in  relation  to  the  ntap  derived  for  integrated  excitatory  bandwidth  calculated  at  40 
dB  above  threshold  whereby  the  maximum  in  the  BW40  distribution  was  assigned  the  reference  value  of  0 
mm.  The  positions  of  recording  points  dorsal  (negative  values)  and  ventral  (positive  values)  are  shown 
relative  to  this  0  mm  position.  In  general,  while  the  overall  pattern  of  the  spatial  distribution  for  single  and 
multiple  unit  was  similar  onset  latencies  tended  to  be  shorter  for  multiple  units  than  for  single  units. 

Fig.  7A,B-  Regional  compari.son  of  the  mean  and  variance  of  onset  latencies  30dB  above  response 
threshold  for  (A)  single  units  (SU)  and  (B)  multiple  units  (MU). 


Fig.  8A,B.  Regional  comparison  of  the  minimum  latency  for  (A)  single  unit  (SU)  and  (B)  multiple  unit 
(MU)  responses.  Minimum  latency  corresponds  to  the  stimulus  condition  that  elicited  the  shoncst  latency 
independent  of  stimulus  intensity. 

Fig.  9A-E,  Spatial  distributions  of  binaural  responses  for  the  four  cases  studied.  A  Schematic 
representation  of  the  mapped  area  on  the  right  hemisphere  of  case  #86-173.  The  rectangle  in  the  inset 
approximates  tiie  mapped  area.  The  filled  cii'cles  in  the  enlarged  rectangle  represent  EE  cells,  open  circles 
represent  El  cells  and  filled  triangles  indicate  EO  cells.  B  Contour  plot  of  the  data  illustrated  in  A  with  the 
stippled  areas  representing  EE  or  EO  cells  and  the  white  areas  represent  locations  of  £1  cells.  The  clashed 
line  indicates  the  border  between  AI  and  All  as  detci'.nincd  by  the  integrated  excitatory  bandwidth 
measurements.  C-E  Contour  plots  of  binaural  responses  for  the  three  tenroining  coses. 

Fig,  10A,B.  Compaiison  of  the  spatial  distribution  of  onset  latency  and  binaural  responses  for  (A)  case 
#87-518  and  (B)  case  #87-001.  The  shaded  areas  for  the  binaural  response  plots  represent  EE  cells  while 
the  shaded  regions  for  the  latency  plots  indicate  locations  where  units  had  onset  latencies  longer  than  10.6 
ms.  The  dashed  line  represents  the  orientation  of  the  isofrcquency  axis  for  each  case.  In  general,  there  was 
no  covariance  of  onset  latency  and  binaural  response. 

Fig  11.  Distribution  of  Q-lOdB  values  along  the  dorso-vcntral  extent  of  AI  for  case  #86-173.  The 
transitions  from  the  central,  more  sharply  tuned  region  toward  me  dorsal  and  ventral,  less  sharply 
tuned  legions,  serve  as  boundaries  between  the  dorsal,  cennal  and  ventral  subregions  of  tlte 
isofrcquency  domain. 


Table  1;  Distribution  of  the  average  minimum  response  latency  for  binaural  interaction  classes. 


Case 

EE 

El 

#86-173 

14.1  ±2.7  (93) 

14.5  ±3.1  (64) 

#87-001 

12.2  ±2.2  (40) 

11.6  ±1.4  (46) 

#87-518 

10.9  ±1.5  (44) 

11.2  ±1.6  (31) 

#87-706 

12.4  ±2.3  (41) 

11.3  ±1.7  (31) 

EO 


I 

14.8  ±1.8  (35)  I 


Values  are  means  ±SD;  number  of  cortical  locations  given  in  parentheses. 

EE,  locations  with  excitatory/excitatory  binaural  inteiaction;  El,  locations  with  excitatory/inhibitory 
interaction;  EO,  locations  with  only  monaurally  evoked  responses.  Only  one  case  had  sufficient 
numbers  of  EO  responses. 


Table  2:  Coirelation  of  minimum  onset  latency  with  conical  response  measures  obtained  at  the  same 
locations. 


AI 


Ald^ 


AIv^ 


Case 

r 

P 

r 

P 

r 

P 

Q-lOdB 

#87-706 

-0.38 

0.0003 

-0.49 

0.0005 

#87-001 

-0.31 

0.002 

-0.39 

0.002 

Q-40dB 

#87-706 

#87-001 

-0.34 

0.0008 

-0.39 

0.002 

-0.31 

0.05 

Step 

#87-518 

#87-706 

-0.25 

0.03 

-0.50 

0.002 

#87-001 

-0.24 

0.02 

THR 

#87-706 

-0.30 

0.05 

MON 

#87-518 

-0.34 

0.005 

#87-706 

-0.46 

0.0001 

-0.48 

0.001 

-0.45 

0.006 

SRL 

#87-706 

-0.42 

0.0001 

-0.57 

0.0001 

-0.35 

0.03 

#87-001 

-0.37 

0.03 

DR 

#87-001 

-0.42 

0.02 

DS 

#87-518 

-0.29 

0.009 

-0.34 

0.02 

#87-706 

0.28 

0.05 

0.33 

0.03 

0.34 

0.04 

#87-001 

0.27 

0.007 

0.31 

0.02 

Linear  regression  analysis;  r  =  correlation  coefficient;  P  =  level  of  significance  (F-test). 

^Ald  repres'  nts  recording  location  in  the  portion  of  Al  dorsal  to  the  most  shaiply  tuned  multiple  unit 
responses  (Schreiner  and  Mendelson  1990). 

l^AIv  represents  location  ventral  to  the  most  sharply  tuned  region  of  AL 


THR,  threshold;  MON,  degree  of  monotonicity;  SRL,  strongest  response  level;  DR,  dynamic  range;  DS. 
direction  selective  response  to  FM  sweeps;  Step,  response  strength  to  a  fixquency  step  from  0.2  kHz  to  64 
kldz,  i.c.  broad-band  transient  (Schreiner  and  Mendelson  1990). 


Table  3:  Parameter  distributions  for  three  binaural  response  classes  (three  animals). 


N 

Q-lOdB 

Q-40dB 

Step  (%) 

EE 

129 

4.87  ±  3.05 

1.87  ±  1.02  -| 

64.1  ±  70.1 

El 

116 

5.19  ±  2.94 

2.38  ±  1.72  - 

43.1  ±  65.0 

EO 

15 

4.5  ±  1.97 

1.81  ±0.81 

61.2  ±  86.3 

N 

THR  (dB  SPL) 

SRL  (dB  SPL) 

MON  (%/dB) 

EE 

129 

20.5  ±  12.8  - 

44.9  ±  16.8  “ 

0.24  ±  0.69 

El 

116 

15.5  ±  11.1  : 

35.0  ±  15.8  : 

0.39  ±0.81 

EO 

15 

23.0  ±  15.3  - 

48  .5  ±  15.3  - 

0,26  ±  0,68 

N 

Latency  (ms) 

PS 

DS 

EE 

129 

11.8  ±  2.4 

2.20  ±0.24  - 

0.05  ±0.17 

El 

116 

11.4  ±  2.4 

2.09  ±0.24  - 

0.04  ±0.17 

EO 

15 

10.9  ±  1.4 

2.15  ±0.32 

0.01  ±0.17 

DR(dB) 

20.1  ±  in  - 

17.9  ±  6.9  - 
18.5  ±6.6 


Brackets  indicate  statistically  significant  differences  (ANOVA,  p<0.05)  between  binaural  classes. 
PS,  preferred  speed.  For  further  explanations  sec  Tables  1  and  2. 


Table  IV:  Regional  distribution  of  parameter  values  for  two  binaural  response  classes. 


Dorsal 

Central 

Ventral 

N 

N 

N 

Q-lOdB 

All 

83 

1 

4.39±  2.65 

95 

5.70  ±  2.88 

84 

1 

4.82  ±3.17 

EE 

51 

4.19  ±2.58 

44 

5.65  ±  3.06 

34 

4.86  ±3.52 

El 

29 

4,77  ±2.73 

45 

5.94  ±2.84 

43 

4.70  ±3.10 

Q-40dB 

All 

82 

1 

1.66  ±0.91 

91 

2.57  ±  1.71 

82 

2.01  ±1.20 

EE 

50 

1.61  ±0.96 

41 

2.4i±  I'.ll 

33 

1.54  1  0.64- 

El 

29 

1,78  ±0.87 

45 

2.75  ±2.16 

43 

2.39  ±1.55- 

Step 

All 

82 

55  ±75 

93 

57  ±69 

83 

51  ±64 

r  1 - 

“1 

I 

EE 

50 

44  ±63 

42 

76  ±75-1 

34 

78  ±  69  - 

El 

29 

66  ±85 

45 

42  ±64-1 

41 

29  ±  42  - 

THR 

All 

82 

20,7  ±  12.6 

95 

15.1  ±  11.3 

85 

20.1  ±12.3 

EE 

50 

1 

21.9  ±  13.9 

44 

16.4  ±11,9 

35 

23.9  ±11.1- 

£I 

29 

17.8  ±  iO.l 

45 

13.1  ±  10.4 

42 

16.6  ±  12,3- 

SRL 

All 

77 

*  1 

47.3  ±  15.2 

93 

1 

35.5  ±  16.3 

80 

i 

39.7  ±  17.6 

EE 

46 

49.1  ±  14.8 -| 

43 

39.1  ±  17.8-1 

31 

46.8  ±  16.4- 

El 

28 

41.9±  13.6-* 

45 

31.9±  14.5-* 

42 

33.6  ±  17.3- 

MON 

All 

76 

-0.18  ±0.74 

88 

-0.52  ±  0.74 

71 

-0.24 '±  0.77 

EE 

44 

-0.28  ±0.61 

39 

-0.37  ±  0.72 

25 

-0.01 '±  0.73' 

El 

29 

-0.06  ±  0.9 

44 

-0.63  ±  0.76 

39 

-0,36  ±0.71' 

DR 

All 

76 

1 

20.5  ±6.2 

89 

18.4  ±7.5 

71 

1 

17.9  ±8.1 

EE 

44 

21,0  ±6.6 

40 

19.4  ±8.1 

25 

19.8  ±9.0 

El 

29 

1 

20.2  ±  5.4 

44 

17.7  ±6.9 

39 

16.3  ±7.6 

Table  IV  (contLnued) 


Dorsal 

CenU'al 

Ventral 

N 

N 

N 

Latency 

All 

81 

11.8±2.1 

94 

11.2  ±2.4 

84 

11.6  ±2.5 

EE 

49 

12.2  ±2.3 

43 

11.4  ±2.7 

34 

11.6  ±1.9 

El 

29 

ll.3±  1.7 

45 

11.1±2.2 

43 

11.7  ±2.9 

PS 

All 

83 

I 

2.22  ±0.23 

95 

1  1 

2.04  ±  0.28 

84 

1 

2.21  ±0.28 

EE 

51 

2.21  ±  0.24 

44 

2. 14  ±0.23^ 

1 

1 

2.27  ±  0.24 

El 

29 

2.20  ±0.20 

45 

1.93  ±o!^ 

1  42 

2.18  ±0.30 

DS 

All 

83 

* 

0.01  ±  0.17 

95 

"1 

-0.09  ±0.16 

84 

1 

-0:04  ±0.15 

EE 

51 

1"  T" ' . . . 

0.01  ±0.17 

44 

. 1 

-0.09  ±0.16 

34 

""  1 

-0.08  ±0.15 

El 

29 

■0.02  ±0.17 

45 

-0.08  ±0.17 

42 

-0.02  ±0.15 

Brackets  indicate  differences  (ANOVA,  P<0.05)  between  regional  or  binaural 
parameter  distributions.  For  furtlicr  explanation  sec  Tables  1  and  2. 
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ABSTRACT 

Most  of  the  functional  aspects  of  the  primary  auditory  cortex  have  been 
systematically  explored  using  narrow-band  stimuli.  Naturally  occurring 
sounds  are  usually  spectrally  complex  and  not  easily  characterized  by 
narrow-band  functions.  In  this  study,  the  cortical  response  to  a  class  of 
spectrally  complex  broad-band  stimuli  is  systematically  explored.  The  stimulus 
is  a  harmonic  series  of  components  whose  spectral  envelope  is  sinusoidally 
modulated.  We  show  that  responses  of  neurons  in  the  primary  auditory  cortex 
are  influenced  by  a  number  of  spectral  envelope  parameters  including  the 
spacing  of  peaks  in  tlie  spectral  envelope,  the  phase  of  the  envelope,  and  its 
modulation  depth.  Neuronal  responses  are  also  Influenced  by  carrier 
properties  such  as  the  specific  spacings  of  the  harmonic  components,  the  total 
bandwidth  of  the  stimulus  and  the  overall  intensity.  There  is  little  apparent 
difference  between  the  spectral  envelope  of  transfer  functions,  phase 
response  profiles,  and  intensity  response  profiles  for  single  and  multiple 
units.  The  results  suggest  an  additional  approach  for  characterizing  responses 
of  neurons  to  broad-band  stimuli  that  are  modeled  after  a  behaviorally 
relevant  class  of  sounds.  Categorization  of  different  types  of  responses  to  these 
stimuli  is  possible  and  can  complement  and  expand  the  classical  description  of 
cortical  receptive  fields. 
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INl'K.ODUCnON 

The  scruccure  of  nauu’ally  occurring  sounds  is  frequently  temporally  and 
spectrally  complex.  Much  of  the  exploration  on  the  functional  organization  of  the 
auditory  cortex  in  mammals,  however,  has  relied  on  the  response  to  more  simple 
stimuli  such  as  pure  tones  (e.g.,  Merzenich  et  al.  1975;  Reale  and  Imig,  1980;  Phillips 
and  Irvine,  1981;  Phillips  et  al.,  1985;  Sciireiner  and  Mendelson,  1990;  Schreiner  et  al., 

1992) ,  and  more  parametrically  accessible  stimuli  sucli  as  amplitude  modiUated  tones 
or  noise  (e.g.,  Phillips  and  Hall,  1987;  Schreiner  and  Urbas,  1988;  Phillips  et  al.,  1989; 
Eggermont,  1993),  and  frequency  modulations  (e.g.,  Suga  and  Jen,  1976;  Mendelson 
and  Cynader,  1985;  Heil  et  al.,  1992b;  Mendelson  and  Grasse,  1992;  Mendelson  et  al. 

1993) .  Using  these  stimuli,  a  number  of  organizational  principler  of  auditory  cortical 
fields,  particularly  of  the  primarj'  auditory  field  (Al),  have  emerged  that  are  likely  to 
provide  the  basis  for  the  processing  of  the  more  complex  stimuli  in  the  acoustic 
biotope  of  each  animal.  Among  the  findings  were  the  discovery  of  a  systematic 
frequency  representation  in  Al  (e.g.,  Woolsey  and  Walzl,  1942;  Meizenich  et  al.,  1975), 
a  non-uniform  spatial  di.-)tribution  of  the  bandwidth  of  frequency  tuning  curves 
(Suga,  1965;  Schreiner  and  Mendelson,  1990;  Schreiner  and  Sutter,  1992;  Heil  et  al., 
1992b),  a  non-uniform  distribution  of  intensity  related  responses  (Suga,  1977: 
Asanuma  et  al.,  1983;  Phillips  et  al.,  1985;  Schreiner  et  al.  1992;  Heil  et  al.,  i992b),  a 
non-uniform  distribution  of  inhibitory  sideband  characteristics  (Shamma  et  al.,  1992; 
Sutter  et  al.,  1992),  preferential  responses  to  specific  frequency  sweeps  (Shamma  et 
al.,  1992;  Mendelson  et  al.,  1993),  regions  responsive  to  different  binaural  Interaction 
cues  (Imig  and  Adrian,  1977;  Middlebrooks  et  al.,  1980)  and  regions  sensitive  to 
different  spatial  cues  (Imig  et  al.,  1990,  Rajan  et  al.,  1990).  Each  of  these  studies 
directly  describes  receptive  field  properties  for  coding  specific  stimuli  features 
including  spectral,  temporal  and  spatial  characteristics.  Similar  observations  have 
been  made  in  the  equivalent  area  ,  Field  L,  of  birds  (Langner  et  al,  1981;  Hose  et  al., 
1987;  Heil  et  al.,  1992a). 

However,  the  relationships  between  these  basic  functional  organizations  of 
cortical  fields  and  the  neuronal  responses  to  natural  signals,  as  well  as  the  coding 
principles  of  complex  signals,  such  as  environmental  sounds  and  communication 
sounds,  have  not  been  well  established.  It  is  not  yet  evident  whether  the  receptive 
field  properties  described  for  the  coding  of  simple  spectral,  temporal,  and  sound 
localization  cues  are  suitable  and  sufficient  to  predict  the  neuronal  response  to  a 
large  variety  of  complex  signals.  This  is  largely  due  to  three  limitations.  First,  the 
description  of  coitical  coding  properties  has  concentrated  individually  on  spectral, 
temporal  and  sound  localization  cues.  For  each  of  these  cues,  a  number  of  descriptors 
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have  been  used  to  charaaerize  the  corresponding  receptive  field  aspects.  However,  it 
is  not  clear  which  of  the  currently  used  descripcors  are  necessary  for  a  complete 
characterization  of  tlie  receptive  field,  and  which  of  the  paiameters  provide 
redundant  or  limited  information.  For  example,  the  intensity  dependence  of  tonal 
response  is  usually  characterized  by  the  response  threshold,  the  dynamic  range,  the 
best  level,  and/or  an  estimate  of  the  moiiotonicity  of  the  rate/level  function. 

However,  even  a  combination  of  all  four  descriptors  is  not  sufficient  to  fully 
reconstruct  the  rate/level  function  and  make  predictions  of  the  firing  rate  for 
arbitrary  soun''  levels  (see  Raggio  and  Schreiner,  1994). 

Second,  inteiactions  between  the  three  main  aspects  of  auditory  receptive  fields, 
i.e.,  spectral,  temporal,  and  spatial,  have  been  insufficiently  addressed.  In  other 
words,  it  is  not  clear  whether  the  spectral,  temporal,  and  spatial  coding  mechanisms 
operate  independendy,  and  whether  they  can  be  characterized  independently  of 
each  other. 

And  third,  the  high  degree  of  temporal  and  spectral  complexity  of  many  natural 
or  artificial  signals  renders  the  systematic  definition  and  classification  of  relevant 
stimulus  dimensions  in  those  sounds  difficult.  Consequently,  stimulus  dimensions  to 
completely  and  appropriately  describe  receptive  fields  in  a  systematic  manner  have 
not  been  fully  identified. 

This  study  begins  to  explore  the  cortical  coding  of  a  particular  stimulus 
characteristic  of  broad-band  stimuli  -  the  spectral  envelope.  It  uses  several 
characteristics  of  the  spectral  envelope  that  potentially  carry  much  of  the  spectial 
information  including  spacing  between  peak  intensities,  depth  of  modulation, 
overall  intensity  and  bandwidth.  Many  auditory  stimuli  including  most  human 
vocalizations,  music  and  other  naturally  occurring  and  artificial  sounds,  are 
characterized  by  broad-band  spectra  with  distinctive,  non-uniform  spectral 
envelopes.  Spectral  envelopes  are  often  essential  for  classification  as  demonstrated 
by  the  distina  perceptual  differences  between  various  vowel  sounds.  Vowel  spectra 
are  characterized  by  several  frequency  regions  with  increased  spectral  energy 
(formants)  separated  by  regions  with  decreased  spectral  energy.  Different  vowels 
are  formed  by  altering  the  formant  spacing  and/or  position  (e.g.,  Dickson  and  Maue- 
Dickson,  1982).  Fig.  1  shows  a  spectral  envelope  for  a  human  vowel  sound  and  for  a 
cat  vocalization.  Both  vocalizations  are  marked  by  distinct  maxima  and  minima  in 
their  spectral  envelope.  Recent  evidence  from  studies  on  the  sharpness  of  timing 
(Schreiner  and  Mende’son,  1990,  Sutter  and  Schreiner,  1991;  Schreiner  and  Sutter, 
1992)  and  the  distribution  of  inhibitory  sidebands  (Shamma  et  al.,  1993)  suggest  that 
cortira)  neurons  are  able  to  distinguish  between  different  spectral  shapes. 
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One  approach  to  generalize  and  parameterize  this  stimulus  characteristic  is  the 
use  of  broad-band  spectra  that  have  sinusoidal  spectral  envelopes  with  regard  to 
logarithmic  frequency  and  intensity  scales.  The  benefits  of  'ripple  spectra'  as 
exploratory  stimuli  lie  a)  in  an  easy  characterization  of  the  spectral  envelope  that  is 
parametrically  accessible  and  resembles  features  of  naturally  occurring  complex 
signals,  b)  in  the  faa  that  the  stimulus  characterization  is  largely  independent  of  the 
spectral  content  and  the  location  of  the  'carrier'  signal,  c)  in  its  use  as  a  tool  for  the 
exploration  and  characterization  of  the  spectral  receptive  field  using  a  broad-band 
stimulus,  d)  in  its  potential  use  to  be  able  to  compare  these  characterizations  with 
those  obtained  with  traditional  methods  such  as  single-  and  two-tone  stimuli 
(Calhoun  et  al.,  1993;  Shamma  et  al.,  1994) ,  and  e)  in  its  potential  use  as  a  stimulus  to 
condua  a  system- theoretical  approach  to  the  central  auditory  system  that  is 
equivalent  to  the  spatial-frequency  analysis  used  in  the  visual  system  and  that  may 
lead  to  an  understanding  for  the  processing  of  arbitrary  spectral  envelope  wave 
forms  (Hillier,  1991). 

The  spectral  envelope  of  a  ripple  stimulus  shares  several  important 
characteristics  with  luminance  gratings  commonly  used  to  study  processing  in  the 
visual  system.  Both  stimuli  provide  similar  distributed  excitations  across  the  receptor 
surface:  in  the  visual  system,  the  magnitude  of  the  stimulus  varies  sinusoidally  across 
the  retina  while  in  the  auditory  system,  the  magnitude  varies  sinusoidally  along  the 
basilar  membrane.  Application  of  this  method  in  the  visual  system  has  lead  to 
classificadons  of  neurons  by  their  response  to  sinusoidal  gratings.  Preferences  to 
spatial  characteristics  of  the  stimulus,  such  as  the  grating  frequency,  modulation, 
and  orientation,  revealed  important  properties  of  cenU'al  visual  processing  and 
provided  a  way  of  characterizing  those  properties. 

In  the  first  part  of  this  study,  we  concentrate  on  characterizing  the 
activity  of  cortical  neurons  and  groups  of  neurons  in  the  primary  auditory 
cortex  of  the  cat  in  response  to  ripple  spectra  for  a  range  of  ripple  parameters 
such  as  spearal  envelope  modulation  frequency,  phase,  and  modulation  depth. 

The  relationships  of  ripple  spectrum  responses  to  properties  of  spectral 
receptive  fields  as  obtained  with  pure  tones  and  the  system-theoretically  based 
derivation  of  spectral  receptive  fields  from  ripple  transfer  functions  will  be 
shown  in  forthcoming  papers. 

METHODS 

Surgery  and  Animal  Preparation 
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Data  were  collected  from  22  healthy  adult  cats.  Federal  and  institutional 
gmdelin''  s  were  followed  in  the  care  and  use  of  the  cats.  The  cats  were  pre¬ 
anesthetized  using  a  mixture  of  ketamine  hydrochloride  (10  mg/kg)  and 
acepromazine  maleate  (0.25  mg/kg).  They  were  given  dexamethasone  sodium 
phosphate  (0.25  mg/kg/24  h)  to  control  brain  edema,  and  atropine  sulfate  (0.25 
mg/12  h)  to  control  mucous  production.  The  femoral  vein  was  exposed,  and  a  venous 
cannuladon  was  “'erformed.  After  an  initial  dose  of  sodium  pentobarbital  (to  effect, 
~30mg/kg),  an  a.  lexic,  hydrated  state  was  maintained  through  constant  infusion  of 
an  8:1  mixture  of  lactated  ringer's  solution  and  sodium  pentobarbital  (-4  ml/h)  with 
supplemental  intravenous  injections  of  sodium  pentobarbital  as  needed.  A  tracheal 
cannula  was  inserted.  A  rectal  temperature  probe  was  used  to  record  the  temperature 
of  the  animal  and  maintain  it  at  37.5  C  using  a  feedback  controlled  heated  water 
blanket.  The  ECG  and  respiration  rate  was  remotely  monitored  throughout  the 
experiment. 

The  head  of  the  cat  was  fixed  with  a  mouth  bar  leaving  the  external  meat! 
unobstructed.  The  temporal  muscle  over  the  right  hemisphere  was  then  retracted.  A 
cianiotomy  was  used  to  expose  the  lateral  cortex  above  the  ectosylvian  sulcus. 

Finally,  the  primary  auditory  cortex  was  exposed  by  incising  and  reflecting  the  dura. 
The  cortex  was  covered  with  silicon  oil.  For  recording  single  units,  the  cortex  was 
usually  covered  with  a  1.5%  soludon  of  clear  agarose  in  saline  which  diminished 
conical  pulsations  while  providing  a  fairly  unobstructed  view  of  the  cortical  surface. 

Hollow  ear  bars  were  insened  into  the  ear  canals  to  deliver  the  stimuli  and  a 
micromanipulator  was  positioned  so  that  an  electrode  could  be  insened 
perpendicular  to  the  ■■  'n  -  e  of  the  cortex. 

Electrophysic'l  ugy 

Paryle  :  '•ieu  ^sten  electrodes  (Microprobe)  with  an  impedance  of  0.8  - 
1.2  MOhms  at  i  'iz  wei  inserted  into  the  auditory  cortex.  A  differential  amplifier 
(World  Precision  Instruments  DAM70-E)  filtered  the  activity  below  1  kHz  and  above 
10  kHz.  A  window  discriminator  (BAK  DIS-1)  permitted  acceptable  threshold  levels 
and  wave  form  criteria  to  be  set  to  distinguish  action  potentials  from  the  background 
signal.  In  this  manner,  responses  of  a  single  unit,  or  of  multiple  units,  could  be 
selected  and  recorded.  Activity  of  an  acceptable  size  and  shape  resulted  in  a  trigger 
pulse  that  was  sent  to  a  computer  (DEC  11/73)  that  recorded  the  number  and  time  of 
the  pulses  to  predetermined  stimuli  for  later  analysis. 


Acoustic  Stimuli 
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Experiments  were  conducted  in  a  double-walled  sound-sliielded  room  (lAC). 
Auditory  stimuli  were  generated  with  a  stand-alone  digital  signal  processor  (DSP 
TlvlSSEOlO)  with  a  16-bit  de-glitched  DAC  at  a  sampling  rate  of  60  or  120  kllz.  The 
stimulus  was  low-pass  filtered  96  dB/oct  at  15  or  50  kllz,  respectively.  The  amplitude 
of  tonal  stimuli  was  controlled  by  the  number  of  amplitude  steps  in  the  generating 
wave  foriTi.  Each  step  corresponded  to  ~0.1S  mV  resulting  in  a  dynamic  range  of  70 
dB.  To  ensure  a  good  signal-to-noise  ratio  for  the  complex  sounds,  the  full  amplitude 
was  used  for  those  conditions.  Passive  attenuators  (Hewlett  Packard  3S0D)  provided 
additional  variable  attenuation.  The  system  was  designed  to  provide  a  fairly  flat 
transfer  function  when  connected  to  the  average  cat  ear  (Sokolich  1981,  US  Patent 
42S1686).  Headphones  (STAX  54)  were  enclosed  in  small  chambers  and  connected  to 
sound  delivery  tubes.  The  tubes  were  inserted  into  the  acoustic  meati.  Two  different 
types  of  stimuli  were  used:  tonal  stimuli  and  ripple  stimuli.  The  tonal  stimuli  were 
used  to  determine  the  frequency  response  areas,  while  the  ripple  stimuli  were  used  to 
determine  the  transfer  functions  and  response  proillcs  to  spectrally  complex  signals. 

Frequency  Response  Area 

Using  a  manually  controlled  frequency  generator  (General  Radio;  1309-A 
Oscillator)  to  search  for  a  response,  neurons  between  600ii  and  1200;i  were  identified. 
Upon  finding  a  responsive  single  neuron  or  neuron  cluster,  the  first  step  was  to 
determine  the  frequency  response  area  (FRA).  After  making  an  Initial  estimation  of 
the  characteristic  frequency  (CF)  and  bandwidth  of  the  receptive  field  by  manually 
varying  the  frequency  and  intensity  and  using  audiovisual  response  criteria,  the 
FUA  was  obtained  by  pseudorandomly  presenting  stimuli  from  15  intensity  levels  and 
45  frequencies.  The  levels  were  steps  of  5  dB,  giving  a  sampled  dynamic  range  of  70 
dB.  The  frequency  range  was  centered  around  the  manually  determined  CF  of  the 
recording  site,  and  covered  between  3  and  5  octaves  depending  upon  the  estimated 
bandwidth.  The  frequencies  were  spaced  in  vequal  fractions  of  an  octave  over  the 
entire  range.  Each  stimulus  was  presented  for  50  ms  with  a  3  ms  rise  time,  and  400  ms 
inter  stimulus  interval. 

Ripple  Spectra 

Once  the  FRA  was  determined,  the  stimulus  was  changed  from  pure  tones  to 
broad-band  stimuli  with  distinct  spectral  envelopes.  These  stin*  ’li  will  be  referred  to 
here  as  "ripple  stimuli".  The  ripple  stimulus  is  genei  ated  by  sinusoidally  modulating 
the  spectral  magnitude  of  a  carrier  along  a  logarithmic  frequency  scale.  iOO  to  256 
harmonic  signals  were  used  as  carriers.  The  fundamental  frequency  of  these 
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harmonic  components  could  be  varied  so  that  they  usually  ranged  from  SO  to  200  Hz 
and  the  maximum  number  of  components  was  below  256  (the  maximum  number  that 
could  be  produced  by  the  DSP).  The  phase  of  the  indi\idual  frequency  components 
was  fixed  such  that  eacli  component  was  phase  shifted  53*  farther  than  the  previous 
one.  This  shifting  eliminated  strong  peakiness  in  the  temporal  wave  form  that  would 
have  resulted  with  superposition  of  the  components  in  cosine  or  sine  phase.  When 
plotting  the  spectrum  on  a  double  logarithmic  scale,  the  spectral  envelope  of  the 
stimulus  was  sinusoidal.  Its  bandwidth  was  set  to  3  octaves  with  the  geometric  center 
of  the  band  at  a  spectral  maximum  that  usually  was  selected  to  correspond  to  the  CF  of 
tlie  neuron.  Since  the  individual  components  were  linearly  spaced,  a  6  dB/oaave 
decrease  in  the  envelope  of  the  ripple  maintained  a  constant  energy  level  per  octave. 
The  inverse  wavelength  of  the  sinusoidal  spectral  envelope  is  referred  to  as  ripple 
density  and  is  expressed  in  ripples/octavc;  the  modulation  of  the  spectral  envelope  is 
sinusoidal  on  a  logarithmic  scale  and  the  standard  spectral  modulation  depth  (ripple 
depth)  was  30  dB.  The  phase  of  the  spectral  envelope  (ripple  phase)  is  defined  as  0* 
when  the  center  peak  of  the  spectral  envelope  at  the  geometric  mean  of  the  stimulus 
is  aligned  with  the  CF  of  the  recording  site.  TTic  overall  intensity  of  the  stimulus  is 
expressed  in  dB  SPL  (as  measured  on  the  linear  setting  of  a  6  &  K  sound  level  meter  at 
the  end  of  the  car  bars).  The  stimulus  had  a  duration  of  100  ms  including  S  ms  rise 
and  fall  times.  The  Interstimulus  Interval  was  usually  700  ms,  and  was  increased  to  1- 
l.Ss  if  obvious  adaptation  effects  were  noted. 

A  scliematic  depiction  of  the  standard  stimulus,  with  a  ripple  density  of  1 
ripple/octave  and  a  modulation  depth  of  30dB,  is  shown  in  Fig.  2A.  Starting  with  the 
standai’d  stimulus,  individual  parameters  could  be  varied.  For  example,  the  ripple 
density  can  be  changed  (Fig.  2B)  and  the  modulation  depth  can  be  varied  (Fig.  2C).  In 
addition,  tlie  overall  intensity  can  be  clianged  by  increasing  or  decreasing  each 
component  by  the  same  amount,  and  the  fundamental  frequency  as  well  as  the 
spectral  width  can  be  varied.  As  the  ripple  phase  is  varied,  the  central  peak  In  the 
spectrum  will  move  off  the  CF  until  a  trough  is  aiigueu  Witll  die  CF  (Fig.  3), 
co.rrespondlng  to  a  180“  phase  shift.  Note  that  envelope  phase  variations  do  not  alter 
die  frequency  or  phase  propeixies  of  the  carrier  components,  i.e.,  the  location  of  the 
carrier  spectrum  is  not  shifted.  The  amount  of  frequency  shift  Af  (In  oaaves)  of  the 
apparent  spectral  peaks  or  troughs  Is  determined  by  the  ripple  phase  shift  Ad)  and 

Che  ripple  density  (KD): 

Af  -  (1  /RD)  *  (-A<D/360). 


Ripple  2'ransfcr  J'luicUon 
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Ripple  transfer  functions  (RTFs)  express  the  firing  rate  in  response  to  a  ripple 
spectrum  as  a  function  of  ripple  density.  To  find  an  appropriate  intensity  for  the 
stimulus,  a  ripple  stimulus  was  presented  with  a  modulation  depth  of  30  dB  and  a 
ripple  density  of  1  ripple/octave  and  the  overall  intensity  was  varied  until  the  level 
evoking  the  strongest  response  was  determined  using  an  audiovisual  measure  of 
response.  This  best  intensity  was  used  to  obtain  the  ripple  transfer  functions. 

Data  were  collected  for  an  ordered  sequence  of  ripple  densities  (15  values;  0-6 
rip/oct;  0.2  to  1  rip/oct  spacing)  or  a  pseudorandom  sequence  (14  or  20  values;  0  -  8.66 
rip/oct;  0.33  to  0.66  rip/oct  spacing).  Each  stimulus  condition  was  presented  25  times. 
In  order  to  minimize  adaptation  to  a  particular  stimulus,  the  inter  stimulus  interval 
was  at  least  750  ms,  and  occasionally  over  1  second.  In  most  cases,  this  long  inter 
stimulus  interval  allowed  the  responses  to  the  stimuli  to  be  equally  strong  at  tlie  end 
of  the  25  presentations  as  at  the  beginning. 

The  data  were  collected  in  post  stimulus  time  histograms  (PSTHs)  allowing  the 
number  of  action  potentials  and  latency  of  each  response  to  be  determined.  In  order 
to  create  the  ripple  transfer  functions  from  the  PS’^s,  the  number  of  responses  in 
each  PSTH  are  determined.  Since  tonic  responses  were  not  seen,  we  only  consider  the 
number  of  spikes  in  a  30ms  window  that  begins  with  the  first  spike  after  the  onset  of 
the  stimulus. 

Once  the  standard  RTF  was  determined,  response  profiles  were  created 
for  several  other  stimulus  parameters.  Response  profiles  plot  the  strength  of 
the  neuronal  response  as  a  function  of  a  specific  parameter  and  were  obtained 
for  the  ripple  modulation  depth,  the  phase  of  the  spectral  envelope,  and  the 
overall  intensity  of  the  stimulus.  In  addition,  effcas  on  responses  by  the 
fundamental  frequency  and  the  bandwidth  of  the  stimulus  were  investigated. 

In  each  case,  PSn-Is  were  obtained  for  25  presentations  of  the  stimulus.  The 
PSTIls  were  analyzed  and  the  response  profiles  plotted  in  the  same  manner  as 
for  the  RTFs. 

Analysis 

From  the  RTFs  and,  if  appropriate,  the  response  profiles,  trie  following 
properties  were  determined;  the  parameter  values  for  the  strongest  or  best 
response,  the  maximum  and  minimum  spike  response,  the  bandwidth  of  the 
response  and  the  shape  of  the  response.  Based  upon  their  shape,  RTFs  were 
categorized  by  the  following  filter  types:  band  pass,  low  pass,  high  pass,  notch 
and  miscellaneous.  To  be  classified,  stimulus  specific  response  modulations 
had  to  show  at  least  a  30%  difference  from  the  best  response.  Intensity 
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response  profiles  were  classified  as  monotonic  or  nonmonotonic  according  to 
the  growth  of  the  response  at  high  intensities.  Additionally,  a  modulation 
index  of  the  RTFs  and  response  profiles  was  determined  by  dividing  the 
difference  between  the  maximum  and  minimum  response  by  the  maximum 
response.  This  index  varies  between  0  and  1,  and  reflects  the  degree  of 
response  modulation  due  to  changes  in  the  varied  parameter.  The  modulation 
index  was  determined  for  RTFs  (Ripple  Modulation  Index,  RMI)  and  for  the 
phase  profiles  (Phase  Modulation  Index,  PMI).  The  best  response  of  a  RTF  or  a 
response  profile  was  the  value  of  the  variable  parameter  that  elicited  the 
strongest  response.  If  two  neighboring  parameter  values  elicited  identical 
responses,  their  values  were  averaged.  The  maximum  response  was  the  largest 
number  of  spikes  that  occurred  following  a  stimulus  onset.  The  minimum 
response  was  the  average  of  the  three  parameter  values  that  elicited  the  lowest 
number  of  spikes  following  stimulus  onset.  The  bandwidth  of  a  RTF  is  defmed 
as  the  range  of  ripple  densities  over  which  the  response  is  larger  than  the 
50%  value  between  maximum  and  minimum  response. 

Where  possible,  analyses  were  carried  out  individually  for  single  unit 
and  multiple  unit  recording  sites  and  compared  to  each  other.  First,  the 
variance  and  the  mean  of  the  two  populations  were  determined.  Then  a 
variance  ratio  test  was  used  to  determine  whether  the  variances  of  the  two 
populations  were  the  same,  and  a  t-test  was  used  to  compare  the  means  of  the 
two  populations.  To  increase  the  power  of  the  test,  and  reduce  the  possibility 
of  false  negatives,  a  -  0.1  was  used  for  the  test  value.  To  compare  the 
frequency  distribution  of  filter  classifications,  a  contingency  table  was  set  up 
and  the  chi-squared  test  used. 

In  addition  to  comparing  single  and  multiple  units,  statistical  analysis 
was  used  to  compare  the  positive  and  negative  phase  symmetry  indices  by 
taking  the  absolute  value  of  both  indices,  and  comparing  their  variances  and 


means.  Again,  the  a- value  was  set  to  0.1  to  increase  the  power  of  the 


comparison. 


RESULTS 

Data  were  collected  from  201  nmltiple  units  and  77  single  units  recorded  in  the 
primary  auditory  cortex  (AI)  of  22  adult  cats.  The  first  several  multiple  unit 
recordings  we*'e  used  to  identify  AI.  Isofrequency  contours  identified  the  rostral  and 
caudal  boundaries  of  AI,  while  the  sharpness  of  tuning  (Schremer  and  Mendelson, 
1990)  and  threshold  identified  the  ventral  and  dorsal  regions  (Schreiner  and 
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Cynader,  1984;  Schreiner  et  al.,  1992).  The  recording  sites  spanned  characteristic 
frequencies  of  1  -  20  kHz,  with  the  majority  being  in  the  3-8  kHz  range.  Since  most 
of  the  recordings  were  taken  from  the  central  two-thirds  of  Al,  the  rostral/caudal 
boundaries  did  not  have  to  be  precisely  located.  However,  we  did  try  to  obtain 
responses  from  all  regions  of  the  isofrequency  domain.  The  wide  range  of  sharpness 
of  tuning,  with  Q,-10dB  and  Q.-40dB  values  ranging  from  0.5  to  10,  indicates  that  a 
fairly  diverse  region,  dorsal  to  ventral,  was  included  in  our  sampling  (Schreiner  and 
Mendelson,  1990;  Schreiner  and  Sutter,  1992). 

Ripple  Transfer  Functions 

Approximately  5%  of  the  neurons  identified  in  Al  with  pure  tone  stimuli  did 
not  respond  to  any  of  the  ripple  stimuli.  All  of  these  neurons  were  located  in  a 
narrow  central  region  of  Al,  were  extremely  sharply  tuned,  and  had  highly 
nonmonotonic  rate/level  functions  for  pure  tones.  For  each  of  the  remaining  95%  of 
the  recording  locations,  a  ripple  transfer  function  (RTF)  was  obtained  at  a  ripple 
phase  of  0°  and  a  spectral  envelope  modulation  depth  of  30dB.  Figure  4A  illustrates 
the  construction  of  an  RTF  for  a  single  neuron.  The  PSTHs  for  ten  different  ripple 
densities  are  shown.  Each  PSTH  has  been  labeled  with  the  ripple  density  of  the 
stimulus  and  the  number  of  action  potentials  elicited.  For  ease  in  comparison,  the 
PSTHs  have  been  placed  in  ascending  order  of  ripple  density  and  not  in  the  pseudo 
random  order  in  which  they  were  presented  to  the  animal.  The  ripple  transfer 
function  plots  the  number  of  spikes  on  the  ordinate  vs.  the  ripple  density  on  the 
absdssa  (Fig.  4B).  The  filled  points  represent  one  set  of  ten  stimuli,  tlij  open  points 
represent  a  second,  complementary  set  of  ten  stimuli.  The  line  represents  a  two- 
point  average  of  the  full  data  set.  Several  common  properties  of  RTFs  can  be 
illustrated  with  this  example  including  a  best  ripple  density,  the  filter  shape,  tlie 
bandwidth  of  the  response,  and  the  modulation  index.  Commonly,  the  RTF  reveals  a 
range  of  ripple  densities  for  which  the  response  of  the  neuron  is  opdmal.  The  best 
ripple  density  (BRD,  the  ripple  density  that  produced  the  strongest  response),  is  used 
as  one  of  the  descriptors  of  RTFs.  In  this  case,  the  neurons  have  a  best  ripple  density 
of  1  ripple/octave.  The  shape  of  the  RTF  in  Fig.  4b  can  be  classified  as  band  pass 
since  the  responses  to  ripple  densities  below  and  above  the  best  ripple  densities  are 
at  least  30%  smaller  than  at  the  best  ripple  density. 

Another  descriptor  is  found  by  determining  the  points  below  and  above  the 
peak  where  the  response  is  half  way  between  its  maximum  and  minimum.  ITie 
distance  between  these  points,  expressed  in  ripples/octave,  is  the  bandwidth  of  the 
RTF  at  50%.  In  this  case,  the  bandwidth  at  50%  is  1.1  ripples/octave. 
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Finally,  the  ripple  modulation  index  (RMI  -  (max.  response  -  min.  response)  / 
max.  response)  was  determined.  In  this  example,  RMI  -  (34  -  2)/(34)  -  0.91.  Figure  4c 
summarizes  the  various  descriptors  for  a  the  RTF  of  a  different  neuron. 

Figure  5  shows  twelve  representative  ripple  transfer  functions  constructed 
from  14-20  different  ripple  stimuli  ranging  from  0  to  8.6  ripples/octave.  The 
transfer  functions  with  two  distinct  symbols  (Figures  5a,  g,  and  i)  are  made  of  two 
pseudorandom  sets  of  stimuli  presented  with  a  IS  second  break  between  sets.  The 
others  are  from  stimuli  presented  in  order  from  low  to  high  ripple  density.  The  six 
examples  on  the  left  (a-f)  are  typical  RTFs  showing  band-pass  filter  characteristics. 

For  tliese  representative  examples,  the  best  ripple  densities  range  from  0.7  to  5.0 
ripples/octave.  The  six  examples  on  the  right  (g-1)  show  representative  examples  of 
RTFs  that  are  not  band-pass  filters.  A  notch  filter  is  shown  in  Fig.  5g‘.  ripple  densities 
between  1  and  4  ripples/oaave  elicit  almost  no  response  while  the  responses  to  the 
highest  ripple  densities  are  within  50%  of  the  maximum  response.  A  low  pass  filter 
characterisitic  is  illustrated  in  Fig.  Sh.  In  this  particular  case,  the  lowest  ripple 
density  presented  was  0.3  lipples/octave.  If  lower  ripple  densities  had  been 
presented,  ther  e  is  a  possibility  this  neuron  would  have  turned  out  to  be  a  band-pass 
filter.  Double  band-pass  filters  occur  when  there  are  two  distinct  peaks  in  the  ripple 
transfer  function  with  the  secondary  one  at  least  50%  as  high  as  the  primary  one  (as 
seen  in  Figure  51).  For  tiie  purpose  of  this  paper,  there  will  not  be  a  distinction  made 
between  double  band-pass  and  band-pass  aside  from  noting  theh*  existence.  The  RTF 
in  Fig.  Sj  shows  the  characteristics  of  a  band  pass  filter.  Since  the  second  peak  is  not 
within  50%  of  die  first,  it  was  not  regarded  as  a  distinct  maximum.  Fhially,  a  high 
pass  and  an  all-pass  filter  are  shown  in  Fig.  5k  and  51,  respectively.  Obviously,  in 
determining  the  type  of  filter,  there  is  always  some  overlap,  and  approximation.  For 
instance,  unless  a  ripple  den.sity  of  0  ripples/octave  was  presented,  a  low  pass  filter 
could  actually  be  inisclassifled  as  a  band-pass  filter.  If  a  filter  did  not  have  a  clear 
classification,  either  because  the  data  were  too  noisy,  or  because  it  overlapped  with 
other  categories  it  was  considered  'miscellaneous'.  All-pass  filters  (Fig.  51)  were  also 
classified  as  miscellaneous.  The  majority  of  neurons  (48%)  were  band-pass,  while 
18%  were  low  pass,  6%  notch  filters,  3%  high  pass,  and  25%  'miscellaneous'  (Figure 
6a). 

No  significant  difference  was  found  between  the  characteristics  of  RTFs  of  the 
single  units  (examples  in  Figs.  5b,  c,  d,  c,  g,  h,  and  j)  and  multiple  units  (examples  in 
Figs.  5a,  f,  i,  k.  and  1).  That  there  was  little  or  no  difference  in  the  descriptors  of 
single  and  multiple  units  can  be  seen  in  the  summaiy  histograms  (Figure  6a:  filter 
shapes;  Figure  6b:  best  ripple  densities).  The  majority  of  units,  both  single  and 
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multiple,  were  band  pass.  Their  best  ripple  density  ranged  from  less  than  1 
ripple/ octave  to  over  4  ripples/octave  with  a  mean  of  1.11  +/-  0.86  for  single  units 
and  1.25  +/-  1.16  for  multiple  units.  To  a  significance  level  of  p  -  0.10,  no  difference 
was  found  for  the  two  populations  between  the  variances,  the  means,  or  the  filtering 
characteristics  (sec  Table  1). 

The  bandwidth  of  band  pass  RTFs  can  also  be  used  to  classify  RTFs.  A  wide 
variety  of  bandwidths  of  RTFs  was  observed  ranging  from  less  than  1  ripple/octave 
(e.g..  Fig.  Sa)  to  over  3  ripples/octavc  (e.g..  Figs.  5e  and  f).  A  frequency  distribution 
for  the  bandwidth  of  the  RTFs  (Fig.  6c)  shows  tliat  the  bandwidth  was  usually  around 
0.5  - 1.0  ripples/octave  although  it  was  not  unusual  for  the  bandwidth  to  reach  3.0 
ripples/octave.  Again,  the  mean  and  variance  for  both  single  and  multiple  units 
were  determined  (see  Table  1)  and  compared.  To  a  significance  level  of  p  -  0.10,  no 
significant  difference  was  found. 

The  modulation  index  of  the  RTFs  ranged  from  RMI  -  1  (l.e.,  RTFs  reflected  a 
clear  response  to  some  ripple  densities  and  no  response  to  other  ripple  densities)  to 
nearly  zero  (i.e.,  hardly  any  difference  was  seen  between  the  responses  to  different 
ripple  densities).  Neurons  that  did  not  respond  to  ripple  stimuli  at  all  were  not 
included.  Figure  Sc  shows  an  example  with  a  large  modulation  index  (RMI  «  1) 
whereas  the  RTF  in  Fig.  51  has  a  small  modulation  index  of  RMI  -  0.36.  Accordingly, 
there  is  a  small  difference  between  the  maximum  and  minimum  responses  in  tlie 
first  case,  and  a  large  difference  in  the  second  case.  The  majority  of  neurons  and 
neuron  clusters  had  RTF  modulation  Indices  above  0.5.  Mean  values  over  all 
neurons  were  RMI  -  0.81  +/-  0.16  for  single  units,  and  RMI  »  0.71+/-  0.21  for  multiple 
units  (Fig.  6d),  i.e.,  single  unit  RlTs  showed  a  slightly  higher  modulation  of  response 
strength  chan  multiple  unit  RTFs.  The  variances  between  the  two  populations,  for  p  - 
0.1,  showed  a  significant  difference  indicating  that  the  populations  were  different. 
However,  there  was  no  significant  difference  between  the  means  (see  Table  1). 

In  summary,  the  systematic  investigation  of  the  influence  of  a  range  of 
spectral  envelope  frequencies  on  cortical  responses  to  broad-band  stimuli  revealed 
that  a)  the  majority  of  cortical  neurons  are  tuned  to  a  specific  range  of  spectral 
envelope  frequencies;  b)  about  half  envelope  transfer  functions  have  band-pass 
chai  acteristics;  the  remaining  RTFs  are  low  pass,  high  pass,  notch,  or  of 
miscellaneous  type;  c)  best  ripple  densities  range  from  0.3  to  5  ripples/octave  with  a 
mean  around  1.2  ripples/octave;  d)  the  sharpness  of  tuning  to  ripple  densities  varies 
between  0.3  ind  3  ripples/octave  with  an  average  bandwidth  of  1.3  ripples/octave;  e) 
the  degree  of  modulation  of  the  response  due  to  changes  in  ripple  density  of  the 
stimulus  can  vary  from  nearly  0  to  100%  with  a  mean  of  approximately  75%. 
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Phase  Respoase  Profiles 

In  the  construction  of  RTFs,  the  stimulus  was  always  set  so  that  a  spectral 
maximum,  located  at  the  geometric  center  of  the  stimulus,  coincided  with  the  CF  of 
the  neuron  or  neuron  duster  under  study.  This  phase  constancy  of  the  spectral 
envelope  of  the  stimulus  relative  to  the  receptive  field  of  the  neuron(s)  provides  an 
important  constraint  for  the  interpretation  of  the  described  response  dependencies. 

It  is  apparent  that  other  ripple  phase  choices,  such  as  +180'  or  -180*,  corresponding 
to  a  spectral  minimum  at  the  CF  of  the  neuron,  could  result  in  quite  different 
neuronal  responses.  In  t  his  section,  the  influence  of  tlie  spectral  envelope  phase  on 
the  response  to  a  fbccd  ripple  density  is  evaluated.  A  phase  shift  of  the  spectral 
envelope  corresponds  to  a  frequency  shift  of  the  ripple  maxima.  A  positive  phase 
shift  moves  all  peaks  and  troughs  to  lower  frequencies  while  a  negative  phase  shift 
will  result  in  the  movement  of  the  peaks  and  troughs  to  higher  frequencies.  The 
frequency  content  of  the  carrier  spectrum  does  not  dunge  with  these  phase  shifts. 

Figure  7  shows  the  phase  response  profile,  the  spike  count  vs.  the  ripple  phase 
of  the  stimulus,  for  six  different  recording  sites.  In  each  case,  a  phase  shift  of  0’ 
corresponds  to  a  maximum  aligned  with  the  CF,  and  a  phase  shift  of  +180*  or  -180' 
corresponds  with  a  minimum  aligned  with  the  CF  of  the  recording  site.  The  ripple 
densities  that  were  used  to  determine  tlie  phase  response  profiles  corresponded  to 
either  the  best  ripple  density  of  the  unit,  or  were  approximately  1  ripple/octave 
since  that  was  the  average  best  ripple  density.  The  actual  ripple  density  used  is  stated 
in  tile  legend.  It  is  apparent  that  the  best  phase  or  the  phase  producing  the  strongest 
response  in  each  example  is  at,  or  near,  zero.  Slight  deviations  from  0“  of  the 
maximum  of  several  phase  profiles  can  be  partially  attributed  to  small  differences  In 
tile  value  of  the  CF  at  die  threshold  of  the  frequency  response  area  and  the  best 
frequency  at  the  actual  intensity  of  the  spectral  peak. 

in  a  linear  system,  the  overall  shape  of  Che  pha:;e  profiles  would  be  cosine 
funedons,  indicated  by  the  dashed  lines  in  Figure  7.  The  obseiwatlon  that  the  phase 
profiles  actually  obtahied  are  narrower  than  such  funedons  and  are  also  somewhat 
asymmetric  reflects  the  operation  of  nonlinear  elements  In  the  processing  of  these 
stimuli.  Three  parameters  will  be  used  to  characterize  the  shape  of  the  phase 
profiles:  the  width  at  the  50%  point,  the  symmetry  of  the  profile  at  the  50%  point,  and 
the  degree  of  moduladon  of  the  firing  rate  by  phase  shifts  expressed  as  die  phase 
modulation  index  (PMl,  see  Methods).  The  width  of  the  phase  profiles  was  commonly 
between  60”  and  140“  with  a  mean  of  100’  +/-  29”.  This  is  significantly  different 
from  the  width  of  a  pure  cosine  function  (180°). 
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For  individual  neurons,  positive  and  negative  phase  shifts  did  not  result  in 
identical  changes  in  the  firing  rate  resulting  in  asymmetric  phase  profiles  (see  Fig. 

10).  This  behavior  is  quantified  by  tile  Symmetry  Index  (SI).  This  index  was  found 
by  determining  the  positive  and  negative  phase  shifts  required  to  reduce  tlie 
response  by  50%  and  then  calculating  SI  as  follows:  SI-  (<l>pos  -  ®neg)/(‘^pos  + 

Oneg)-  Values  of  SI  can  vary  from  +1  to  -1  with  a  value  of  zero  reflecting  a  symmetric 
shape  of  the  phase  profile.  Figure  8  shows  the  Symmetry  Index  for  1 1  single  and  ] 
multiple  units.  The  responses  were  neai‘ly  equally  divided  between  positive  and 
negative  values.  The  average  SI  was  -0.1  +/-0.4.  By  comparing  the  absolute  values  of 
the  positive  and  negative  asymmetries,  we  were  able  to  determine  whether  the 
neurons,  as  a  population,  favored  one  direction,  or  behaved  differently  in  one 
direction.  The  variance  and  the  absolute  value  of  tlie  mean  for  the  population  of 
neurons  wltli  a  positive  asymmetry  were  not  significantly  different  from  those  with 
a  negative  asymmetry  (see  Table  3). 

Evaluation  of  the  phase  modulation  Index  reveals  that  most  phase  profiles  in 
our  sample  exliibited  a  full  modulation  of  the  firing  rate  by  phase  shifts.  This 
modulation  is  similar  to  that  seen  for  most  R'fFs.  The  average  value  for  PMI  was 
0.77+/-0.22,  and  the  mean  and  vaidauce  were  not  significantly  different  from  those 
of  the  ripple  modulation  Index. 

In  .summary,  the  phase  profiles  revealed  that  there  is  a  strong  dependence  of 
the  responses  on  the  position  of  the  spectral  envelope  relative  to  the  CF  of  the 
neuron.  The  relatively  narrow  width  of  the  profile  and  Its  asymmetry  strongly 
suggest  the  Influence  of  nonlinear  mechanisms  sucli  as  compressive  nonlinearities 
and  nonuniformly  distributed  inhibitory  influences  that  contribute  to  the 
processing  of  spectral  envelope  Information. 

Intensity  Response  Profiles 

To  determine  RTFs  and  phase  profiles,  audiovisual  criteria  were  used  to  set  the 
Intensity  to  produce  a  strong  response.  Usually,  this  intensity  was  approximately  15 
to  25dB  above  the  response  threshold  for  the  tested  ripple  density  (ripple  pha.se  -  O’). 
To  test  the  overall  influence  of  the  stimulus  intensity  on  the  response  to  ripple 
stimuli,  rate-level  functions  were  obtained  ii..i  ripple  densities  at  or  near  tlic  best 
ripple  density  of  the  neuron  or  neuron  group.  Figure  9  shows  six  examples  of  rate- 
level  functions.  Neurons  were  classified  as  monotonic,  nonmonotonic,  or 
inconclusive.  For  nonmonotonic  neurons,  the  firing  rate  peaked  for  a  specific  "best 
intensity",  and  then  proceeded  to  decrease.  Nonmonotonic  neurons  are  defined  as 
decreasing  in  their  firing  rate  at  least  39%  with  a  15  dB  increase  in  intensity.  Since 
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different  neurons  had  different  best  intensities  the  intensity  for  some  stimuli  were 
never  increased  to  levels  large  enough  to  determine  monotonicity.  Therefore,  in 
addition  to  a  definite  decrease  in  spike  rate,  the  greatest  intensity  had  to  be  at  least  15 
dB  greater  than  the  spike  rate  that  elicited  the  best  response.  Neurons  which  did  not 
decrease  their  spike  rate  to  below  70%  for  any  intensity  and  did  have  at  least  a  15  dB 
difference  between  the  best  stimulus  intensity  and  tlie  greatest  stimulus  intensity 
were  considered  monotonic.  Neurons  that  did  not  fulfill  either  requirement  were 
classified  as  inconclusive  (i.e.  Fig  9a).  In  four  out  of  the  sbc  examples  (Fig.  9b,  c,  e, 
and  f),  the  firing  rate  for  the  highest  tested  intensity  was  less  than  70%  of  the 
maximum  firing  rate,  i.e.,  they  were  nonmonotonic.  The  neuron  shown  in  Figure  9d 
is  clearly  monotonlc.  The  firing  rate  stays  above  70%  of  the  maximum  firing  rate  for 
at  least  a  30  dB  increase  In  intensity  beyond  the  best  Intensity.  Of  the  42  units  for 
which  intensity  response  profiles  were  collected,  only  eight  had  clearly  monotonic 
responses  while  twenty-five  were  nonmonotonic,  and  nine  were  inconclusive. 
Therefore,  of  the  units  where  an  Identification  could  be  made,  24%  (8  out  of  33)  were 
monotonic,  and  the  remaining  76%  (25  out  of  33)  were  nonmonotonic.  Most  of  the 
units  also  had  a  narrow  dynamic  range,  going  from  threshold  to  a  peak  response  in 
20  dB. 

Modulation  Depth  Response  Profiles 

One  of  the  principal  descriptors  of  the  spectral  envelope  is  the  depth  of 
spectral  modulation.  In  the  ripple  stimuli  used  to  find  the  phase  and  intensity 
profiles,  and  the  RTFs,  the  modulation  depth  was  uniformly  set  to  30dB.  To  determine 
the  modulation  depth  response  profiles,  the  ripple  density,  intensity,  and  phase  were 
set,  and  the  modulation  depth  was  varied.  The  ripple  density  was  set  at,  or  near,  the 
best  ripple  density,  the  phase  was  sec  to  0",  and  the  intensity  was  the  same  as  for 
determining  the  RTF  and  phtise  response.  Then  six  to  ten  different  modulation  depths 
were  presented  between  0  and  40  dB.  An  Increase  of  the  modulation  depth 
corresponded  to  a  lowering  of  the  troughs  while  keeping  the  levels  of  the  spectral 
peaks  constant.  As  a  consequence,  an  increase  in  modulation  depth  resulted  in  a 
slight  decrease  in  total  energy  (see  Methods).  Figure  11  shows  six  examples  of 
modulation  depth  profiles.  As  shown  in  the  first  five  examples  (Fig.  11a,  b,  c,  d,  and 
e),  the  response  usually  increases  as  the  modulation  depth  increases  until  a  maximum 
or  plateau  is  reached  at  depths  ranging  from  5  to  30  dB.  As  the  modulation  depth 
increased  further,  the  firing  rate  was  fairly  independent  of  modulation  depths. 

Figure  Ilf  shows  two  modulation  depth  response  profiles  taken  for  the  same  unit. 

The  closed  circles  were  taken  using  the  best  ripple  density  of  1.0  ripple/octave  while 
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the  open  circles  used  a  ripple  density  of  3.0  ripples/oaave.  The  clear  differences 
between  the  two  profiles  indicate  that  the  depth  profile  depends  on  the  ripple  density 
used.  Figure  12  shows  the  frequency  distribution  for  the  lowest  of  the  near  optimal 
(best)  modulation  depths,  at  the  best  ripple  density  of  the  unit.  The  mean  best 
modulation  depth  was  20+/-8dB.  For  most  imits,  however,  a  modulation  depth  of  3QdB 
still  produced  near  optimal  responses.  Therefore,  this  modulation  depth  was  seleaed 
for  obtaining  most  Rl'Fs  and  other  response  profiles. 

Fundamental  Frequency  Response  Profile 

The  ripple  stimulus  used  in  this  study  consisted  of  a  series  of  harmonically 
related  components.  The  fundamental  frequency  of  the  harmonic  complex  was 
chosen  so  that  the  complex  contained  at  least  12^  components  and  maximally  256 
components.  As  a  consequence,  the  lowest  octave  of  the  3’  octave  wide  stimulus 
contained  ai  least  18  and  maximally  36  components  thereby  providing  an  adequate 
representation  of  the  sinusoidal  spectral  envelope  (See  Figure  1).  The  fundamental 
frequencies  actually  used  for  obtaining  RTTs  and  response  profiles  ranged  from 
37.5112  to  150Hz.  To  systematically  estimate  the  influence  of  variations  of  the 
fundamental  frequency  on  the  response  to  a  ripple  stimulus,  we  varied  the 
fundamental  frequency  over  a  wide  range  for  12  neurons.  The  fundamental 
frequency  response  profiles  in  Fig.  13  illustrate  that  there  were  definite  changes  in 
tlie  responsiveness  as  the  fundamental  frequency  varied.  For  higher  fundamental 
frequencies,  the  spectral  envelope  slope  is  under-sampled  especially  for  high  ripple 
densities  and  is  no  longer  well  represented.  Therefore,  it  is  reasonable  to  expect  that 
at  high  fundamental  frequencies,  there  are  fairly  large  variations  in  the  response 
magnitude.  The  reason  for  the  variations  at  low  fundamental  frequencies  (see  Fig 
13a,  c,  and  f)  is  less  clear,  and  suggests  a  reliance  of  the  response  on  the  fundamental 
frequency  Itself,  rather  than  exclusively  on  aspeas  of  the  spectral  envelope. 
Fundamentals  ranging  from  100  Hz  to  1000  Hz  (at  which  frequency  the  spectral 
envelope  is  being  grossly  under  sampled)  elicited  strongest  responses.  No  common 
pattern  for  fundamental  frequency  response  profiles  was  apparent. 

Spectral  Width  Response  Profiles 

Finally,  the  influence  of  the  spectral  width  of  the  ripple  stimulus  was 
investigated.  In  all  previous  conditions,  it  was  set  to  three  octaves.  This  width  was 
chosen  to  ensure  that  the  stimulus  covered  the  entire  receptive  field  of  the  recording 
site.  To  evaluate  the  possible  influence  of  the  stimulus  bandwidth  on  the  response  to 
ripple  spectra,  the  bandwidth  was  changed  systematically.  Figure  14  shows  six  typical 


spectral  width  response  profiles.  Two  types  of  spectral  width  response  proflles  cs 
be  distinguished:  those  that  generally  decrease  with  increases  in  bandwidth  (Fi;, 
14a-e),  and  those  that  generally  increase  (Fig.  14f).  Of  the  eleven  units  from  which 
we  recorced  the  spectral  width  response  profile,  two  (18%)  were  increasing,  and 
nine  (82%)  were  decreasing.  As  the  spectral  width  of  the  stimulus  increases,  the 
response  fairly  monotonically  varies  until  it  reaches  a  steady  state  around  a  spectral 
width  of  two  to  three  octaves,  i.e.,  wider  bands  do  not  significantly  influence  the 
response.  Therefore,  the  use  of  a  three  octave  wide  stimulus  throughout  the  study  is 
a  safe  estimate  of  a  bandwidth  that  covers  most  of  the  affected  frequency  receptive 
field. 

DISCUSSION 

This  series  of  experiments  was  conducted  to  investigate  the  response  of 
primary  auditory  cortical  neurons  to  broad-band  sounds  with  distinct  spectral 
envelopes.  In  particular,  it  was  designed  to  determine  whether  neurons  in  AI  are 
sensitive  to  specific  attributes  of  sinusoidal  spectral  envelopes,  and  which  are  the 
most  salient  features  of  spectral  envelopes  that  influence  the  cortical  response. 

We  found  that  neurons  in  the  primary  auditory  cortex  can  be  tuned  to  specific 
features  of  the  sinusoidal  spectral  envelope  of  a  complex  sound  including  the  spacing 
of  the  spectral  maxima  and  minima  and  the  depth  of  spectral  modulation  or  intensity 
contrast.  In  addition,  the  position  of  the  spectral  envelope  relative  to  the  standard 
receptive  field  is  relevant,  i.e.,  the  intensity  and  the  phase  of  the  spectral  envelope 
affect  the  response.  Carrier  signal  parameters  of  the  spectral  envelope  signal,  such 
as  bandwidth  and  spectral  density,  play  a  role  in  shaping  the  cortical  response  as 
well. 

Single-unit  versus  multiple-unit  recordings 

Before  discussing  the  results  of  this  study  in  the  context  of  auditory  coding 
str.'itegies,  a  brief  consideration  will  be  given  to  some  methodological  issues.  Since 
these  are  among  the  first  parameceric  experiments  to  explore  the  responses  of 
cortical  cells  to  characteristics  of  the  spectral  envelope,  we  chose  to  record  from  both 
single  and  multiple  neurons.  The  main  reason  was  the  relatively  large  number  of 
stimulus  parameters  that  needed  to  be  systematically  tested  for  each  recording  site. 
Since  the  isolation  of  single  units  is  more  difficult  to  maintain  over  an  extended 
period  of  time,  the  use  of  multiple  unit  recordings  yielded  a  higher  percentage  of 
locations  that  were  completely  characterized  for  single  tone,  two-tone,  and  ripple 
stimuli.  The  mean  and  variance  of  the  best  ripple  density  of  recording  sites,  the 
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shape  and  bandwidth  of  the  ripple  transfer  function,  and  the  monotonicity  of  the 
intensity  response  profile  were  found  to  be  quite  similar  for  single  and  multiple  unit 
recordings.  Consequently,  multiple  and  single  unit  results  are  discussed  conjointly. 
However,  the  distinct  differences  between  the  two  recording  methods  and  their 
potential  influence  on  the  interpretation  should  be  kept  in  mind  (for  discussion  see 
Schreiner  and  Mendelson,  1990).  Since  some  characteristics  of  pure  tone  frequency 
response  areas,  such  as  sharpness  of  tuning  (Schreiner  and  Sutter,  1992)  and 
monotonicity  of  rate/level  functions  (Sutter  and  Schreiner,  in  press)  can  show 
distinct  differences  between  the  results  from  these  two  recording  methods,  the 
similarity  of  the  results  obtained  from  single  and  multiple  units  for  these  broad-band 
stimuli  is  noteworthy.  The  similarity  between  the  group  response  and  the  element 
response  may  be  an  indication  that  more  comple:t  stimuli,  such  as  ripple  stimuli,  are 
better  suited  than  pure  tones  for  the  study  of  local  cortical  properties,  since  they  may 
engage  and  reveal  more  fully  the  cooperation  within  the  local  neuronal  population. 
Since  the  responses  were  obtained  from  the  middle  laminae,  no  clear  test  of  a 
potential  columnar  organization  was  possible.  However,  an  indication  of 
nonuniform  spatial  distributions  along  the  isofrequency  domain  was  evident 
suggesting  that  the  spatial  frequency  analysis  is  locally  and  globally  not  randomly 
distributed. 

Acoustic  stimulus:  carrier  and  spectral  envelope 

As  pointed  out  in  the  Introduction,  arguments  for  the  use  of  ripple  spectra  in 
the  exploration  of  cortical  processing  come  from  the  study  of  the  visual  cortex,  from 
recent  psychoacoustical  studies  that  focused  on  the  coding  of  spectral  envelopes,  and 
from  system-theoretical  considerations. 

Both  temporally  and  spatially  complex  stimuli  have  been  used  for  the  highly 
advanced  investigation  of  signal  coding  in  the  visual  cortex  (e.g.,  see  Maffei  and 
Fiorentini,  1973;  Albrecht  and  DeVaiois,  1981).  It  has  been  shown  in  the  visual  system 
that  certain  tuning  properties  of  a  neuron  vary  with  some  other  stimulus  properties. 
The  sharpness  of  directionai  orientation  tuning,  for  example,  is  dependent  upon 
whether  the  stimulus  is  a  long  bar  or  a  wide  bar.  It  has  also  been  suggested  that 
dividing  neurons  into  categories  based  on  response  properties  is  dependent  upon  the 
stimulus  used  (Maffei  and  Fiorentini,  1976;  Hammond  and  Munden,  1990).  Of 
particular  interest  in  this  context  are  studies  that  utilize  luminance  gratings  to 
explore  the  properties  of  visual  cortical  neurons  (e.g.,  Maffei  and  Fiorentini,  1976; 
Zhang,  1990;  Jagadeesh  et  al.,  1993,  for  review  see  DeVaiois  and  De  Valois.  1990).  By 
using  the  response  to  sinusoidal  lur.  Ji.  :e  gratings  as  a  basis  for  a  general 
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characterization,  the  response  of  many  simple  cells  to  various  visual  stimuli  has 
proven  to  be  predictable  (Campbell  and  Robson,  1968;  Worgotter  and  Eysel,  1987; 
Jagadeesh  et  al.,  1993;  De  Angelis  et  al.,  1993).  In  this  case,  stimuli  excite  a  wide  range 
of  the  receptor  surface  (the  retina)  in  contrast  to  spatially  restricted  light  points  or 
bars.  By  manipulating  the  characteristics  of  the  gratings,  a  number  of  receptive 
field  properties  can  be  evaluated  that  take  into  account  long  range  and  short  range 
spatial  influences  and  interactions.  This  approach  is  based  on  the  system-theoretical 
equivalence  of  a  system  impulse  response  and  system  transfer  or  filter  function 
applied  to  the  spatial  domain  of  the  receptor  surface.  By  presenting  different  spatial 
frequency  gratings  and  taking  the  fourier  transform  of  the  resulting  transfer 
function,  estimations  of  the  impulse  response  (the  receptive  fields)  for  simple  cells 
can  be  obtained.  Although  investigations  on  the  visual  cortex  have  had  great  success 
determining  and  explaining  receptive  field  properties  by  rigorously  applying 
pai'ametrically  accessible  stimuli  under  system-theoretical  considerations,  little  of 
that  approach  has  been  transferred  to  the  exploration  of  the  central  auditory  system. 

Early  physiological  studies  investigating  affects  of  the  spectral  envelope  on 
auditory  responses  used  a  ripple-like  stimuli,  "cosine  noise"  (de  Boer,  1967;  Evans  et 
al.  1970;  Bilsen  and  Goldstein.  1974;  Bilsen  et  al.,  1975).  By  delaying  white  noise  and 
adding  it  to  itself,  a  broad-band  stimulus  was  created  with  a  sinusoidal  spectral 
envelope,  and  a  carrier  of  noise.  The  resulting  stimulus  had  linearly  spaced  peaks 
and  a  spectral  envelope  amplitude  that  varied  linearly.  Although  ripple-like,  cosine 
noise  is  not  shift  invariant  along  either  the  frequency  or  the  intensity  axes  and, 
therefore,  is  less  suitable  fo."  our  approach. 

Later,  psychophysical  studies  used  sp<.  rtrally  complex  stimuli  to  address  the 
perception  of  spectral  profiles  (e.g.,  Bernstein  and  Green,  1988;  Berg  and  Green,  1990; 
Green  and  Berg,  1991)  and  the  influence  of  spectrally  more  distant  regions  on  the 
processing  and  perception  of  presumably  locally  derived  attributes  (e.g.,  Hall,  et  al, 
1984;  Moore  and  Shailer,  1991;  Richards  and  Heller,  1991).  The  sinusoidal  spectral 
envelope  has  been  used  in  a  few  psychoacoustical  studies  either  to  study  the  effects 
of  spearal  peak  spacings  (Houtgasc,1977;  Hillier,  1991;  Keeling  et  al.,  1993;  Shamma  et 
al.,  1994),  and  to  study  spectral  motion  after-effects  (Shu  et  al.  1993).  Testing  the 
effectiveness  of  a  ripple  stimulus  as  a  forward  or  direct  masker,  Houtgast  (1977) 
showed  psychoacoustically  that  not  only  the  spacing  of  the  peaks,  but  also  the  phase 
of  the  spectral  envelope  is  important.  Investigating  whether  there  are  segregated 
"channels"  in  the  auditory  system  for  taking  the  fourier  transform  of  the  spectral 
envelope,  Hillier  ( 1993)  found  evidence  for  independent  encoding  of  different  ripple 
densities.  While  these  studies  reveal  several  similarities  between  the  behavior  of  the 
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auditory  system,  and  that  of  the  visual  system,  when  using  comparable  stimulus 
features,  no  directly  corresponding  physiological  studies  have  been  undertaken  that 
could  account  for  the  psychophysical  observations. 

The  success  in  the  visual  system  of  elucidating  receptive  field  properties  and 
perceptual  phenomena  with  spatially  complex  stimuli  that  satisfy  certain  system- 
theoretical  constraints,  and  the  importance  of  spectral  envelopes  in  discriminating 
vocalizations,  particularly  vowels,  suggests  a  similar  investigation  of  the  spatial 
dimension  of  sounds,  relative  to  the  receptor  surface,  and  their  relationship  to 
frequency  organization.  The  results  of  such  an  approach  provide  an  expansion  of 
the  possibilities  to  characterize  neuronal  response  properties,  should  allow  an 
assessment  whether  pure  tone  receptive  fields  are  sufficient  to  predict  tlie  response 
to  spectrally  complex  signals,  and  may  reveal  new  insights  into  the  principles 
underlying  the  representation  of  complex  sounds  in  the  auditory  cortex. 

Ripple  Transfer  Functions 

Previous  studies  have  used  tonal  stimuli  to  evaluate  the  receptive  fields  of 
neurons.  Pure-tone  receptive  fields  allow  the  characteri2'.ation  of  the  response 
selectivity  of  neurons  by  extracting  descriptive  parameters  such  as  characteristic 
frequency,  minimum  threshold,  and  the  bandwidth  of  the  tonal  response.  Although 
such  a  characterization  can  be  sufficient  to  predict  the  response  of  a  peripheral 
neuron  to  a  variety  of  stimuli,  there  is  ample  evidence  that  central  neurons  have 
stimulus  selectivities  that  are  not  predictable  from  pure-tone  responses  alone. 
Examples  include  selective  responses  to  complex  sonar  signals  in  bat  auditory  cortical 
neurons  (Suga,  1965),  vocalization  specific  reurons  in  squirrel  monkeys  (Newman 
and  Wollberg,  1973a),  song  selective  neurons  in  birds  (La.ngner  et  al.,  1981;  Miiller 
and  Leppelsack,  1985),  and  syllable  sensitivity  in  human  cortical  neurons 
(Creutzfeldt  et  al.,  1989).  In  order  to  systematically  explore  potential  response 
selectivities  of  cortical  neurons  beyond  frequency  and  intensity,  other  stimulus 
dimensions  that  may  be  of  relevance  to  the  animal  must  be  utilized.  Transfer 
functions,  intensity  response  profiles,  and  phase  response  profiles  of  ripple  spectra 
provide  a  systematic  characterization  of  neuronal  responses  that  may  reveal 
additional  aspects  of  receptive  fields  while  rellecting  some  of  the  properties  seen 
with  pure-tone  responses.  Although  a  thorough  correlational  analysis  between  RTF 
properties  and  pure-. one  frequency  response  areas  will  be  presented  elsewhere,  a 
brief  discussion  on  the  concepts  and  interpretations  of  RTl-  properties  with  regal'd  to 
traditional  tuning  curves  will  be  outlined.  For  the  following  discussion  of  observed 
R'FF  properties,  it  will  suffice  to  consider  only  a  few  of  the  most  common  aspects  of 
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frequency  response  areas,  namely  the  width  and  strength  of  a  central  excitatory 
area  and  the  position  and  strength  of  flanking  inhibitory  sidebands. 

In  this  portion  of  the  study,  RTFs  were  exclusively  obtained  for  the  'cosine' 
phase  of  the  spectral  envelope,  i.e.,  a  spectral  maximum  was  located  at  the  CF  of  the 
neurons.  For  this  condition,  four  characteristics  of  RTFs  were  used  to  describe  the 
variance  in  the  obtained  sample:  the  overall  shape  of  the  RTF,  the  ripple  density 
producing  the  strongest  response,  the  bandwidth  of  the  RTF,  and  the  degree  of 
response  modulation  due  to  changes  in  the  spearal  envelope  frequency. 

The  occurrence  of  the  two  most  common  types  of  RTF  shape,  band-pass  and 
low-pass,  representing  67%  of  the  sample,  can  be  largely  attributed  to  the  presence 
of  relatively  strong  Inhibitory  sidebands  in  the  ITtAs.  With  the  center  peak  of  tire 
ripple  stimulus  aligned  with  the  CF  of  the  unit,  the  maximum  response  to  the  stimulus 
will  be  attained  when  the  neighboring  spectral  minima  arc  positioned  to  coincide 
witli  the  inhibitory  sidebands.  If  the  ripple  density  is  lowered  or  increased,  the 
Invoked  inhibition  will  increase  by  expanding  the  central  peak  into  the  sidebands  or 
by  moving  neighboring  ripple  maxima  into  the  inhibitory  sidebands,  respectively, 
thus  creating  a  band-pass  characteristic  of  the  RTF.  RTFs  with  low  pass 
characteristics  can  be  seen  when  Inhibitory  sidebands  are  absent  or  only  weak. 

Other  RTF  filter  shapes  ai‘e  probably  also  related  to  the  spacing  and  relative  strength 
of  excitatory  and  inhibitory  regions.  Asymmetries  in  the  arrangement  and  strength 
of  these  portions  of  the  FRAs  are  likely  contributing  to  the  creation  of  notch  or 
irregularly  shaped  RTFs.  It  has  been  shown  (Suga  and  Manabe,  1982;  Shamma  et  al., 
1993;  Sutter  and  Schreiner  1990),  that  the  strength  and  relative  position  of  the 
excitatory  and  inhibltoiy  regions  can  vary  substantially  between  neurons.  Thus,  the 
different  receptive  field  organizations  required  to  create  the  observed  RTFs  of  varied 
shapes  do  exist. 

Another  important  descriptor  of  RTFs  relates  to  the  position  of  the  peak  and/or 
the  upper  cut-off  along  the  ripple  density  axis.  These  aspects  of  RTFs  reflect  both  the 
width  of  the  classical  excitatory  receptive  field  and  the  distance  of  the  inhibitory 
sidebands  from  the  center  of  the  excitatory  portion.  Neurons  showing  a  preference 
for  higher  ripple  densities  are  likely  to  have  narrow  tuning  curves  and  more  closely 
spaced  inhibitory  sidebands.  By  contrast,  preferences  for  low  ripple  densities 
probably  reflect  more  broadly  tuned  neurons  that,  as  a  consequence,  have  fairly 
widely  spaced  inhibitory  sidebands.  The  best  ripple  densities  between  0.5  and  4 
ripples  per  octave,  as  seen  for  the  majority  of  neurons  in  this  study,  indicate  that 
spadngs  between  inhibitory  and  excitatory  regions  in  neurons  can  vary  between 
0.25  to  2  octaves,  consistent  with  findings  for  cortical  neurons  obtained  in  two-tone 
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experiments  (Suga  and  Manabe,  1982;  Shamma  and  Symmes,  1985;  Sutter  and 
Schreiner,  1990;  Shamma  et  al.,  1993). 

Although  it  has  been  shown  that  the  distribution  of  excitatory  regions  as  well 
as  that  of  inhibitory  side  bands  and  their  relative  strength  can  be  asymmetrical, 
(Shamma  and  Symmes,  1985;  Sutter  and  Schreiner,  1990;  Schreiner  and  Sutter,  1992; 
Shamma  et  al.,  1993),  RTFs  obtained  by  centering  the  ripple  stimulus  over  the 
characteristic  frequency  of  the  neuron  cannot  necessarily  resolve  the  direction  of 
the  asymmetry.  However,  by  varying  the  phase  of  the  spectral  envelope  for  a  given 
ripple  density  or  by  obtaining  RTFs  for  different  phases,  some  of  these  asymmetries 
can  be  investigated  (see  below). 

The  interpretive  value  of  two  other  descriptors  of  RTFs,  the  bandwidth  and  the 
modulation  index,  is  more  difficult  to  assess.  The  modulation  index  is  an  indicator  of 
the  neuron's  variation  in  activity  due  to  changes  in  the  ripple  density  of  the  spectral 
envelope.  A  high  modulation  index  suggests  a  relatively  high  efficacy  of  excitatory 
or  inhibitory  influences  of  surrounding  areas  on  the  response  strength  of  a  neuron 
and/or  a  large  change  in  the  response  strength  with  small  (~  6dB)  changes  in 
intensity.  A  low  modulation  index  suggests  relatively  weak  excitatory  and  inhibitory 
influences  from  surrounding  areas,  and/or  not  much  change  in  the  strength  of  the 
response  with  small  changes  in  intensity.  The  bandwidth  of  RTFs  also  reflects  these 
properties  but,  in  addition,  incoiporates  aspects  related  to  the  spacing  of  excitatory 
and  Inhibitory  regions  which  are  more  directly  expressed  by  the  best  ripple  density 
or  the  upper  cut-off  density. 

A  useful  feature  to  distinguish  vowels  independent  of  speaker  gender  or  age  is 
to  consider  the  frequency  ratio  of  the  formants  (e.g.  Miller,  1989).  Depending  upon 
the  vowel,  most  formants  are  spaced  between  0.3  and  4.0  octaves  with  the  majority  of 
ratios  cluster  around  1  octave.  The  best  ripple  densities  of  the  neurons  in  Al  in  this 
study  covered  a  similar  range  with  the  majority  tuned  around  1  ripple/octave. 
Preliminary  analysis  of  cat  vocalizations  (see  Figure  1)  also  indicates,  similar  to 
human  vowels,  a  preponderance  of  formant  ratios  around  1  octave. 

Taken  together,  the  characteristics  of  cosine  RTFs  reflect  a  number  of 
receptive  field  properties  that  have  been  observed  with  pure-tone  or  two-tone 
measurements  of  FRAs.  In  addition,  the  RTF  may  provide  information  that  is  usually 
not  reflected  in  FRAs,  namely  the  cumulative  effect  of  excitatory  and  inhibitory 
influences  on  the  response  strength  including  effects  from  regions  outside  the 
classical  receptive  field. 


Response  Profiles 
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The  various  parameters  of  the  ripple  stimulus  that  were  controlled  included 
the  phase  of  the  spectral  envelope,  the  modulation  depth,  the  spacing  of  the 
components,  the  bandwidth  of  the  stimulus,  and  the  overall  intensity. 

Variations  of  the  phase  of  the  spectral  envelope  invariably  had  a  large  effect 
on  the  magnitude  of  the  response.  It  is  known  from  pure  tone  stimuli  that  the 
characteristic  frequency  is  bordered  by  regions  that  are  either  inhibitory  or  non- 
responsive.  Thus,  it  would  be  expected  that  the  strongest  response  to  a  ripple 
stimulus  would  be  for  a  stimulus  with  a  maximal  amount  of  energy  near  the  CF,  and  a 
minimal  amount  of  energy  in  the  bordering  regions  -  especially  those 
corresponding  to  inhibitory  areas.  Since  one  of  the  goals  of  this  particular  stimulus 
was  to  cover  the  entire  spectral  range  which  might  influence  the  response  of  a  cell, 
the  center  peak  of  the  stimulus  was  positioned  at  the  CF  of  the  unit,  covering 
frequencies  1.5  octaves  on  either  side.  Changes  in  the  ripple  phase  for  a  fixed  ripple 
density  should  reveal  whether  this  position  was  indeed  producing  the  strongest 
response.  Only  very  few  phase  profiles  showed  secondary  peaks  or  a  peak 
significantly  shifted  from  the  0°  position.  That  is,  the  alignment  of  a  spectral 
maximum  with  the  CF  was  near  optimal.  Slight  deviations  in  the  phase  profile  of  the 
maximum  from  zero  may  be  accounted  for  by  discrepancies  between  the  BF  of  a 
neuron  at  the  stimulus  intensity  that  corresponded  to  the  local  energy  of  the  ripple 
stimulus  and  the  near  threshold  estimate  of  the  CF. 

If  the  filtering  system  reflected  in  the  RITs  were  linear  and  symmetrical,  the 
phase  response  profile  for  a  given  ripple  density  would  be  sinusoidal  (the  profiles 
would  match  the  sinusoidal  patterns  --  hatched  lines  in  Fig.  10).  The  clear  deviations 
from  the  sinusoidal  profile  in  all  the  phase  profiles  is  a  strong  indication  of  either 
nonlinear  components  or  asymmetries  in  the  filter.  Differences  in  the  symmetry  of 
the  phase  profiles  are  likely  closely  related  to  the  asymmetries  in  the  distribution 
and  strength  of  inhibitory  sidebands  as  shown  with  two-tone  stimuli  for  AI  neurons 
(Shamma  et  al.,  1993;  Sutter  and  Schreiner  1990,  1991).  Equal  numbers  of  positive  and 
negative  asymmetries  suggest  an  equal  number  of  neurons  with  stronger  high  and 
low  frequency  inhibitory  sidebands  (Shamma  et  al,  1993). 

As  the  spectral  modulation  depth  of  a  stimulus  with  fixed  and  near  optimal 
ripple  density  was  systematically  increased,  the  responses  usually  increased  up  to  a 
constant  value.  However,  on  some  occasions  responses  decreased  with  increasing 
modulation  depth.  The  most  likely  explanation  for  this  difference  in  response 
behavior  with  changes  in  modulation  depth  is  again  related  to  the  presence  and 
strength  of  influence  by  inhibitory  sidebands.  If  strong  sidebands  are  present,  an 
increase  in  the  modulation  depth  of  the  optimally  positioned  spectral  envelope  will 
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mostly  decrease  the  input  to  the  inhibitory  sidebands  while  maintaining  the  input  to 
the  central  excitatory  region.  However,  if  no  or  only  weak  inhibition  is  present  or  if 
the  spectral  envelope  is  not  optimally  positioned,  an  increase  in  the  modulation 
depth  may  remove  excitatory  energy  from  the  receptive  field  and,  thus,  reduce  the 
response  strength.  Most  neurons  showed  the  greatest  changes  in  responsiveness  for 
modulation  depths  between  5  and  25dB.  For  a  reference  on  spectral  modulation  depth 
sizes,  vowels  usually  have  spectral  modulation  depths  of  10  -  30  dB  between  formants, 
the  same  range  where  the  neurons  were  most  sensitive  to  change  in  the  modulation 
depth  of  tne  spectral  envelope. 

As  a  function  of  overall  intensity,  the  neurons  predominantly  behaved  in  a 
nonmonotonic  manner  in  response  to  ripple  stimuli.  This  behavior  contrasts  with 
that  seen  in  response  to  pure  tone  stimuli.  Using  pure  tones,  various  portions  of 
sampled  neurons  in  A1  (24-60%)  have  been  reported  to  have  shown  nonmonotonic 
behavior  (Phillips  et  al.,  1985,  1989;  Schreiner  et  al.,  1992).  The  finding  here,  that 
76%  of  the  neurons  showed  nonmonotonic  rate/level  functions  for  ripple  stimuli 
suggests  that  neurons  are  more  selective  or  more  sharply  tuned  to  specific  signal 
intensities  when  stimulated  with  broad-band  stimuli  than  with  narrow-banu  stimuli. 
Similarly,  a  high  percentage  of  nonmonotonic  rate/level  functions  have  been  found 
by  Imig  and  colleagues  (1990)  for  free-tleld  wide-band  noise  stimuli.  Both 
observations  are  consistent  with  the  hypothesis  that  the  activation  of  inhibitory 
sidebands  plays  a  major  role  in  creating  intensity  selectivity  (Phillips,  1988). 

As  was  seen,  there  are  considerable  changes  in  the  response  as  the 
fundamental  frequency  of  the  carrier  band  varies  over  a  limited  range.  As  the 
fundamental  frequency  is  varied,  the  number  of  components  per  octave  or  per 
critical  band  varies.  However,  as  long  as  the  fundamental  frequency  is  small,  the 
energy  per  octave  or  per  critical  band  remains  fairly  constant.  As  the  fundamental 
frequency  increases,  there  are  fewer  components  per  octave  or  per  critical  band.  In 
the  extreme  case,  when  there  are  only  a  few  components  per  octave,  the  spectral 
envelope  shape  will  nor  be  adequately  represented  due  to  undersampling.  At  this 
point,  the  energy  distribution  over  critical  bands  varies  greatly  with  changes  in  the 
fundamental  frequency.  Since  the  local  energy  near  the  CF  contributes 
significantly  to  the  strength  of  the  response,  it  might  be  expected  that  different  high 
fundamental  frequencies  can  produce  large  differences  in  the  response.  By  contrast, 
different  low  fundamental  frenuencies  should  produce  little,  if  any,  variation  in  the 
response  strength.  Thus,  thr  significant  variations  in  the  response  strength,  even 
between  relatively  low  fundamental  frequencies  (50  to  100  Hz),  are  not  consistent 
with  such  an  energy  distribution  hypothesis.  Alternatively,  neuronal  ’:csponses  in 
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AI  could  be  sensitive  for  specific  values  or  value  ranges  of  fundamental  frequencies. 
The  changes  in  the  response  to  ripple  spectra  with  fundamental  frequency  could, 
therefore,  be  due  to  changes  in  periodicity  pitch  that  are  encoded  at  the  cortical 
level  (Langner  et  al.,  1994). 

The  final  stimulus  parameter  that  was  systematically  varied  was  the  spectral 
bandwidth.  Using  a  spectral  width  of  3  octaves  as  the  standard  bandwidth,  we 
attempted  to  reach  the  entire  spectral  influence  sphere  of  the  neuron.  Spectral 
regions  far  outside  the  pure-tone  receptive  field  of  the  neuron  were  not  expected  to 
have  significant  effects  on  the  neuron's  response,  and  therefore,  the  stimuli  used 
were  band-limited.  However,  it  was  important  to  use  a  stimulus  that  was  broad 
enough  to  cover  tlie  whole  exdtatory/inhibltory  receptive  field.  The  strongest 
responses  were  frequently  seen  at  the  smaller  bandwidths  and  there  was  frequently 
a  significant  decline  in  the  neuron's  response  as  tlie  spectral  width  of  the  neuron 
Increased.  Presumably,  this  occuiTed  as  the  stimulus  Invaded  the  inhibitory  regions 
of  the  receptive  field.  As  the  stimulus  continued  to  spread,  its  boundaries  were 
outside  the  receptive  field,  and  further  broadening  did  not  affect  the  response  of  the 
neuron.  For  most  neurons  the  bandwidth  marking  the  transition  to  a  steady  response 
strength  was  approximately  two  to  three  octaves  suggesting  that  Influences  from 
regions  more  than  1  -  l.S  octaves  to  either  side  of  the  CF  are  minimal. 

The  phase  of  the  harmonic  components  was  chosen  to  produce  a  nearly  flat 
temporal  envelope  in  order  to  avoid  potential  effects  from  peripheral  nonlinearities 
observed  for  peaky  wave  forms  (Horst  et  al,,  1990).  Although  the  potential  Influence 
of  the  component  phase  on  the  response  to  ripple  stimuli  remains  to  be  Investigated, 
preliminary  observations  indicate  that  the  effects  will  be  small  compared  to  effects 
seen  for  variations  of  spectral  envelope  parameters. 

In  conclusion,  broad-band  stimuli  with  sinusoidal  spectral  envelopes  were 
utilized  for  analyzing  properties  of  auditory  cortical  neurons.  Tlie  results  show  that 
the  neuron's  responses  are  sensitive,  if  not  selective,  to  specific  aspects  of  the 


spectral  envelope  such  as  the  spatial  frequency,  ripple  phase,  or  ripple  depth.  Thv^ 
results  suggest  that  cortical  neurons  can  represent  the  shape  of  broad-band  stimuli 
in  the  form  of  a  spatial  fourier  analysis  of  the  spectral  envelope.  The  conclusions 
drawn  from  the  ripple  transfer  functions  are  limited  by  the  use  of  only  one  ripple 
phase.  In  order  to  completely  characterize  a  neuron's  response,  each  spectral 
envelope  frequency  must  be  presented  at  least  for  two  orthogonal  phases  (e.g.  0  and 
90  degrees)  so  that  it  analyzes  the  symmetrical  properties  of  the  neurons  response  as 
well  as  the  asymmetrical  properties.  In  addition,  any  possible  relationships  between 


txadicional  pure-tone  tuning  curves,  and  these  response  profiles  must  be 
investigated.  We  will  be  addressing  these  aspects  in  the  future. 


27 


ACKNOWLEDGEMENT 

We  thank  Drs.  D.  Keeling  and  K.  Krueger  who  participated  in  some  of  the 
experiments,  and  Dr.  G.  Langner  for  reviewing  an  earlier  version  of  the 
manuscript. 

Work  was  supported  by  a  grant  from  the  Office  of  Naval  Research 
(N00014-91-J-1317). 


Abbs,  J.H.,  and  H.M.  Sussman  (1971)  Neurophysiological  Feature  Detectors  and 
Speech  Perception:  a  Discussion  of  Theoretical  Implications.  J.  of  Speech  and 
Hearing  Research  14:23-36. 

Aitkin,  L  (1990)  The  auditory  cortex:  Structural  and  functional  bases  of 
auditory  perception.  Chapman  and  Hall,  London. 

Albrecht,  D.G.,  and  R.L  DeValols  (1981)  Striate  conex  responses  to  periodic 
patterns  with  and  without  fundamental  harmonics.  J.  Physiology  319:497-514. 

Asanuma,  A.,  D.  Wong,  and  N.Suga  (1983)  Frequency  and  Amplitude 
Representations  in  the  Anterior  Primary  Auditory  Cortex  of  the  Mustached  Bat. 

J.  Neurophys,  50(.S):  11 82-96. 

Berg,  B.G.,  and  D.M.  Green  (1990)  Spectral  weights  in  profile  listening.  J. 

Acoust.  Soc.  Am.  88(2)758-66). 

Bernstein,  L.R.,  and  D.M.  Green  (1988)  Detection  of  changes  in  spectral  shape: 
uniform  vs.  non-uniform  background  spectra.  Hearing  Research  32:157-66. 

Bllsen,  F.A.  (1977)  Pitcli  of  Noise  Signals;  Evidence  for  a  "Central  Spectrum".  J. 
Acoust.  Soc,  Ani.  61(1)150-61. 

Bllsen,  F.A.  and  J.L  Goldstein  (1974)  Pitch  of  Dichotically  Delayed  Noise  and  its 
Possible  Spectral  Basis.  J.  Acoust.  Soc.  Am.  55(2)292-6. 

Bllsen,  F.A.,  J.H.  ten  Kate,  T.J.F.  Buunen,  and  J.  Raatgever  (1975)  Responses  of 
Single  Units  in  the  Cochlear  Nucleus  of  the  Cat  to  Cosine  Noise.  J.  Acoust.  Soc. 
Am.  58(4):858-866. 

Calhoun,  B.M.  and  C.E.  Schreiner  (1993)  Spatial  frequency  filters  in  cat 
auditory  cortex.  Soc.  Neurosci.  Abstr.,  23:581.8. 

Campbell,  F.W.  and  J.G.  Robson  (1968)  Application  of  Fourier  Analysis  to  the 
Visibility  of  Gratings.  J.  Physiology  197:551-66. 

Creutzfeldt  0.,  G.  Ojemann,  and  C.  Lettlch  (1989)  Neuronal  activity  in  the  human 
lateral  temporal  lobe.  I.  Responses  to  speech.  Experimental  Brain  Research, 
77(3):451-75. 

DeAngelis,  G.C.,  I.  Ohzawa,  and  R.D.  Freeman  (1993)  Spatiotemporal 
organization  of  simple-cell  receptive  fields  in  the  cat's  striate  cortex.  II. 

Linearity  of  temporal  and  spatial  summation.  J  Neurophys.  69(4)1118-35. 

de  Boer,  E.  (1967)  Correlation  Studies  Applied  to  the  Frequency  Resolution  of 
the  Cochlea.  J.  Auditory  Research  7:209-17. 

De  Valois,  R.L.,  and  K.K.  De  Valois  (1990)  Spatial  Vision.  Oxford  University 
Press,  New  York. 

Dickson,  D.P.,  and  N.  Maue-Dickson  (1982)  Anatomical  and  Physiological  Bases 
of  Speech.  Litde,  Brown  and  Company,  Boston. 


29 

Eggermont,  JJ.  (1993)  Differential  effects  of  age  on  click-rate  and  amplitude 
modulation-frequency  coding  in  primary  auditory  cortex  of  the  cat  Hearing 
Research  65(1-2)175-92. 

Evans,  E.F.,  J.  Rosenberg,  and  J.P.  Wilson  (1970)  The  Effective  Bandwidth  of 
Cochlear  Nerve  Fibres.  J.  Physiology  207:62P-63P. 

Green,  D.M.,  and  B.G.  Berg  (1991)  Spectral  weights  and  the  profile  bowl. 

Quarterly  J.  of  Experimental  Psydiology.  a,  Human  Experimental  Psychology 
43(3)449-58. 

Hall,  J.W.,  M.P.  Haggard,  and  M.A.  Fernandes  (1984)  Detection  in  noise  by 
spectro-temporal  pattern  analysis.  J.  Acoustic  Soc.  of  America  76(1)50-56. 

Hammond,  P.,  and  I.M.E.  Munden  (1990)  Areal  influences  on  complex  cells  in 
cat  striate  cortex:  Stimulus-specificity  of  width  and  length  summation. 

Experimental  Brain  Research  80:135-47. 

Heil  P,  G.  Langner  and  II.  Schelch  (1992a)  Processing  of  frequency-modulated 
stimuli  in  the  chick  auditory  cortex  analogue:  evidence  for  topographic 
representations  and  possible  mechanisms  of  rate  and  directional  sensitivity. 

Journal  of  Comparative  Physiology,  a,  Sensory,  Neural,  and  Behavioral 
Physiology  171(5):583-600. 

Hell,  P.,  R.  Rajan,  and  D.R.F.  Irvine  (1992b)  Sensitivity  of  Neurons  in  Cat 
Primary  Auditory  Cortex  to  Frequency-Modulated  Stimuli.  II:  Organization  of 
Response  Propenies  along  the  'Isofrequency'  Dimension.  Hearing  Research 
63:135-56. 

HiUier,  DA  (1991)  Auditory  Processing  of  Sinusoidal  Spectral  Envelopes. 

Doctoral  Thesis.  Washington  University,  St.  Louis. 

Horst,  J.W.,  E.  Javel,  and  G.R.  Farley  (1990)  Coding  of  spectral  fine  structure  in 
the  auditory  nerve.  11.  Level-dependent  nonlinear  responses.  /.  Acoiist.  Soc. 

Am.  88(6)2656-81. 

Hose  B.,  G.  Langner,  and  H.  Schelch  (1987)  Topographic  representation  of 
periodicities  in  the  forebrain  of  the  mynah  bird:  one  map  for  pitch  and 
rhythm?  Brain  Research,  422(2):367-73. 

Houtgast,  T.  (1977)  Auditory-Filter  Characteristics  Derived  from  Direct- 
Masking  Data  and  Pulsation-Threshold  Data  with  a  Rippled-Noise  Masker,  J. 

Acoust.  Soc.  Am.  62(2):409-41S. 

Houtsma,  A.J.M.,  and  J.L.  Goldstein  (1972)  The  Central  Origin  of  the  Pitch  of 
Complex  Tones:  Evidence  for  Musical  Interval  Recognition.  J.  Acoust.  Soc.  Am. 
51(2):520-529. 


Imig,  T.J.,  and  H.O.  Adrian  (1977)  Binaural  Columns  in  the  Primary  Field  (Al)  of 
Cat  Auditory  Cortex.  Brain  Research  138:241-57. 


Imig,  TJ.,  Wj\.  Irons,  and  F.R.  Samson  (1990)  Single-unit  selectivity  to 
azimuthal  direction  and  sound  pressure  level  of  noise  bursts  in  cat  high- 
frequency  primary  auditory  cortex.  J.  Neurophys.  63(6)1448-66. 


30 


Jagadeesh  B,  H.S.Wheat,  and  D.  Ferscer  (1993)  Linearity  of  summation  of 
synaptic  potentials  underlying  direction  selectivity  in  simple  cells  of  the  cat 
visual  cortex.  Science  262(S141):1901-4. 

Kowalski,  N.,  H.  Versnel,  and  S.A.  Shamma  (1993)  Characteristics  of  an  anterior 
field  in  the  ferret  auditory  cortex.  Abstr.  Assoc.  Res.  Otolaryngol.  16:518. 

Langner  G,  D.  Bonke,  H.  Scheich  (1981)  Neuronal  discrimination  of  natural  and 
synthetic  vowels  in  field  L  of  trained  mynah  birds.  Experimental  Brain 
Research,  43(1):  11-24. 

Langner,  G.,  M.  Sams,  P.  Heil,  LK.  McEvoy,  R.  Hari,  and  A.  Ahonen  (1994) 
Periodicity  pitch  is  represented  topographically  in  the  human  auditory  cortex; 
Evidence  by  magnetoencephalography.  Gottingen  Neurobiology  Report  1994 
voin.  2:388. 

Maffei,  L,  and  A.  Fiorentini  (1973)  The  visual  cortex  as  a  spatial  frequency 
analyser.  Vision  Research  13:1255-67. 

Maffei,  L,  and  A.  Fiorentini  (1976)  The  unresponsive  regions  of  visual  cortical 
receptive  fields.  Vision  Research  16:1131-9. 

Mendelson,  J.R.,  and  M.S.  Cynader  (1985)  Sensitivity  of  cat  primary  auditory 
cortex  (Al)  neurons  to  the  direction  and  rate  of  frequency  modulation.  Brain 
Research  327:331-5. 

Mendelson,  J.U.,  and  K.L.  Grasse  (1992)  A  comparison  of  monaural  and  binaural 
responses  to  frequency  modulated  (FM)  sweeps  in  cat  primary  auditory  cortex. 
Exp.  Bjoin  Research  91:435-54. 

Mendelson,  J.R.,  C.E.  Schreiner,  M.S.  Sutter,  and  K.L.  Grasse  (1993)  Functional 
topography  of  cat  primary  auditory  cortex:  responses  to  frequency-modulated 
sweeps.  Exp.  Brain  Research  94:65-87. 

Merzenich,  M.M.,  P.L  Knight,  and  G.L.  Roth  (1975)  Representation  of  cochlea 
within  primary  auditory  corte-;  in  the  cat.  J.  Neurophys.  38(l):231-49. 

Middlebrooks,  J.C.,  R.W.  Dykes,  and  M.M.  Merzenich  (1980)  Binaural  response- 
specific  bands  in  primary  auditory  cortex  (Al)  of  the  cat:  topographical 
organization  orthogonal  to  isofrequency  contours.  Brain  Research  181:31-48. 

Miller,  J.D.  (1989)  Auditor^z-perceptual  Interpretation  of  the  vowel.  J  Acoustic 
Soc  America  85(5)2114-2134. 

Moore,  B.C.J.  (1982)  An  Introduction  to  the  Psychology  of  Hearing.  Academic 
Press:  New  York. 


ore,  B.C.J.,  and  M.J.  Shailer  (1991)  Comodulation  masking  release  as  a 
ccion  of  level.  J  Acoustic  Soc  America  90(2)829-35. 


31 

Narins,  P.M.,  E.F.  Evans,  G.F  Pick,  and  J.P.  Wilson  (1979)  A  comb-filtered  noise 
generator  for  use  in  audiotry  neurophysiological  and  psychophysical 
experiments.  leee  Transactions  in  Biomedical  Engineering  26(1)43-7, 

Newman,  J,D.,  and  Z.  Wollberg  (1973a)  Multiple  coding  of  spedes-specific 
vocalizations  in  the  auditory  cortex  of  squirrel  monkeys.  Brain  Research  54: 

287-304. 

Newman,  J.D.,  and  Z.  Wollberg  (1973b)  Responses  of  single  neurons  in  the 
auditory  cortex  of  squirrel  monkeys  to  variants  of  a  single  call  type.  Exp. 

Neurology  40:821-4. 

Pantev,  C,,  M.  Hoke,  B.  Lutkenhoner,  and  K.  Lehnertz  (1989)  Tonotopic 
organization  of  the  auditory  cortex:  pitch  vs.  frequency  representation. 

Science  246(4929):  486-488. 

Phillips,  D.P.  (1988)  Effect  of  tone-pulse  rise  time  on  rate-level  functions  of  cat 
auditory  cortex  neurons:  excitatory  and  inhibitory  processes  shaping 
responses  to  tone  onset.  J Neurophysioiogy  59(5U524-39. 

Phillips,  D.P.,  and  S.E.  Hall  (1987)  Responses  of  single  neurons  in  cat  auditory 
cortex  to  time-varying  stimuli:  linear  amplitude  modulations.  Exp.  Br. 

Research  67:479-92. 

Phillips,  D.P.,  S.E.  Hall,  and  J.L  Hollett  (1989)  Repetition  rate  and  signal  level 
effects  on  neuronal  responses  to  brief  tone  pulses  in  cat  auditory  cortex,  J. 

Acoust.  Soc.  Am.  85(6)2537-2549. 

Phillips,  D.P.,  and  D.R.F.  Irvine  (1981)  Responses  of  single  neurons  in 
physiologically  defined  primary  auditory  cortex  (Al)  of  the  cat:  frequency 
tuning  and  responses  to  intensity.  J,  Nenrophys.  45(1)48-58. 

Phillips,  D.P.,  S.S.  Orman,  A.D.  Musicant,  and  G.F.  Wilson  (1985)  Neurons  in  the 
cat's  primaiy  auditory  cortex  distinguished  by  their  responses  to  tones  and 
wide-spectrum  noise.  Hearing  Research  18:73-86. 

Raggio,  M.,  and  C.E.  Schreiner  (1994)  Neuronal  responses  in  cat  primary 
auditory  cortex  to  electrical  cochlear  stimulation:  I.  Intensity  dependence  of 
firing  rate  and  response  latency.  J.  Neurophys.  (in  press) 

Rajan,  R.,  L.M.  Aitkin,  D.R.  Irvine,  and  J,  McKay  (1990a)  Azimuthal  sensitivity 
of  neurons  in  primary  auditory  cortex  of  rats.  I.  Types  of  sensitivity  and 
effects  of  variations  in  stimulus  parametres.  J.  Neurophys.  64(3)872-87. 

Rajan,  R.,  LM.  Aitlcin,  and  D.R.  Irvine  (1990b)  Azimuthal  sensitivity  of  neurons 
in  primary  auditory  cortex  of  cats.  II.  Organization  along  frequency-band 
strips.  J.  Neurophys.  64(3)888-902. 

Reale,  RA.,  and  T.J.  Imig  (1980)  Tonotopic  Organization  in  auditory  Cortex  of 
the  Cat.  J.  Comp.  Neurology  192(2)265-91. 

Richards,  V.M.,  and  LM.  Heller  (1991)  The  detection  of  a  tone  added  to  a  narrow 
band  of  noise:  The  energy  model  revisited.  Quarterly  J.  of  Exp.  Psych.  43:481- 
501. 


Rossing,  T.D.  (1990)  The  Science  of  Sound.  Addison-Wesley  Publishing 
Company,  Reading,  Massachusetts. 

Schreiner,  C.E.  (1991)  Functional  Topographies  in  the  primary  audtiory  cortex 
of  the  cat.  Acta  Oto-laryngologica  supp.  491:  7-16. 

Schreiner,  C.E.,  B.  Calhoun,  and  D.  Keeling  (1993)  Physiology  and  topography 
of  cortical  neurons  explored  with  vowel-like  ripple-spectra.  Abstr.  Assoc.  Res. 
Otolaryngol.  16:161. 

Schreiner,  C.E.,  J.R.  Mendelson,  and  M.L  Sutter  (1992)  Functional  topography 
of  cat  primary  auditory  cortex:  representation  of  tone  intensity.  Exp.  Brain 
Res.  92(1):  105-122. 

Schreiner,  C.E.,  and  J.R.  Mendelson  (1990)  Functional  topography  of  cat 
orimary  auditory  corte'c  distribution  of  integrated  excitation.  J.  Neurophys. 
64(5):1442-59. 

Sclirelner,  C.E.,  and  M.L  Sutter  (1992)  Topography  of  excitatory  bandwidth  in 
cat  primary  auditory  cortex:  single-neuron  versus  multiple-neuron 
recordings,  j.  Neurophysiology  68(5)1487 -1502. 

Schi’einer,  C.E.,  and  J.V.Urbas  (1988)  Representation  of  amplitude  modulation 
in  the  auditory  cortex  of  the  cat.  II;  Comparison  between  cortical  fields. 

Hearing  Research  32:49-64. 

Shamma,  SA.,  J.W.  Fleshman,  P.R.  Wiser,  and  H.  Versncl  (1993)  Organization  of 
response  areas  in  ferret  primary'  auditory  cortex.  J.  Neurophysiology  89(2): 
367-383. 

Sli.unma  S.A.  and  D.  Symmes  (198S)  Patterns  of  inhibition  in  auditory  cortical 
ceils  in  awake  squirrel  monkeys.  Hearing  Research,  19(1):  1-13. 

Shamma,  SA.,  H.  Versnel,  and  N.  Kowalski  (1994)  Responses  to  rippled  complex 
sound  stimuli  in  primary  auditory  cortex.  Abstr.  Assoc.  Res.  Otolaryngol.  17:86. 

Shu,  Z.G.,  N.V,  Swindale,  and  M.S.  Cynader  (1993)  Spectral  motion  produces  an 
auditory  after-effect.  Nature  364(6439)721-3. 

Suga,  N.  (1977)  Amplitude  spectrum  representation  in  the  doppler-shifted-CF 
pro  .  ssing  area  of  the  auditory  coitex  of  the  mustache  bat.  Science  196:64-7. 

Suga,  N.  (1965)  Functional  propei  cies  of  auaitory  neurones  in  the  cortex  of 
erho-locadng  bans.  J.  Physiol.  181:671-700. 

Suga,  N.,  :<'id  P.H.S.  Jen  (1976)  Disproportionate  tonotopic  representation  for 
processin,g  C1'-"M  sonar  signals  in  the  mustache  bat  auditory  coitex.  Science 
194:542-4. 

Suga  N.  and  T.  Manabe  (198?)  Neural  basis  of  amplitude-spectrum 
representation  in  auditoiy  concx  of  the  mustached  bat.  Journal  of 
Netncphysiology  47  ( 7 ) ;  2  2  5-5  5 . 


Srtter,  M.L  a.nu  C.E.  Schreiner  (1990)  Two-tone  responses  of  single 
units  in  cat  prirnai-y  auditory'  cortex.  AUO  abstracts  13:221. 


33 

Sutter  M.L  and  C.E.  Schreiner  (1991)  Physiology  and  topography  of  neurons 
with  multipeaked  tuning  curves  in  cat  primary  auditory  cortex.  Journal  of 
Neurophysiology,  65(5):i207'26. 

Sutter,  M.L  and  C.E.  Schreiner  (in  press)  Topography  of  intensity  parameters 
in  cat  primary  auditory  cortex:  single-neuron  versus  multipie-neuron 
recordings.  J.  Neurophysiology. 

VVightman,  F.L.  ( 1973a)  Pitch  and  stimulus  fine  structure,  J.  AcousL  Soc.  Am. 
54(2):397-406. 

Wightman, .  ' ,  (1973b)  The  pattern-transformation  model  of  pitch.  /,  Acoust. 

Soc.  Am.  54(^):407-4l6. 

Winter,  P.,  H.H.  Funkenstein  (1973)  Response  properties  of  auditory  cortical 
cells.  Brain  Research  31  (2):3()S. 

Woolsey,  C.N.,  and  E.M.  Walzl  (1942)  Topical  projection  of  the  nerve  fibers  from 
local  regions  of  the  cochlea  in  the  cerebral  cortex  of  the  cat.  Bull.  Johns 
Hopkins  Hosp.  71:315-44. 

V7orgotter,  F.,  and  U.  Th.  Eysel  (1987)  Quantitative  determination  of 
orientational  and  directional  components  in  the  response  of  visual  cortical 
cells  to  moving  stimuli.  Biological  Cybernetics  57:349-55. 

Zhang,  J.  0.990)  How  to  unconfound  the  directional  and  orientational 
info)*mation  in  visual  neuron's  response.  Biological  Cybernetics  63(2)  135-42. 


34 

Tabk  1 

Hypothesis:  There  is  no  difference  between  the  RTFs  for  single  units  and 
multiple  units  in  terms  of  their  best  ripple  density,  bandwidth  at  50%,  and 
modulation  index  for  an  a  -  0.1. 
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Conclusion:  the  best  ripple  density  and  the  bandwidth  are  not  from 
significantly  different  populations.  Although  the  RMl  are  from  different 
populations  (significantly  different  v.iriances),  the  means  are  not 
significantly  different. 


Iahk.2 

Hypothesis:  There  is  no  difference  between  the  RTF  filter  shapes  for 
single  and  multiple  units  for  an  a  -  0.1. 
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test  stadsdc  -  2.3;  critical  value  -  13.3  (for  a  >  0.1,  d.f.  -  8) 
Conclusion:  the  hypothesis  is  not  rejected. 


I2hk^ 

Hypothesis:  There  is  no  difference  between  the  positive  and  negative 
asymmetries  for  an  a  -  0.1. 
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test  statistic  =  0.28;  critical  value  -  1.8  (for  a  >  0.1,  d.f.  -  10) 
Conclusion:  the  hypothesis  is  not  rejected 
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latle^ 

Hypothesis:  There  is  no  difference  between  the  monotonicity  and 
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test  statistic  -  3.88;  critical  value  -  7,77  (for  u  >  0.1,  d.f.  -  4) 
Conclusion:  the  hypothesis  is  nc  rejected 
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Fig.  1:  Vocalizations:  a)  Schematized  spectral  envelope  of  a  human  vowei 
vocalization.  The  three  formants  are  marked,  b)  Aaual  spectral  envelope  of  a 
cat  vocalization.  The  two  regions  of  highest  energy  are  marked.  Note  tliat  the 
two  peaks  are  nearly  one  octave  apart. 


Fig.  2:  Ripple  Stimulus:  a)  Standard  stimulus:  harmonic  series  of  100  to  200 
components  linearly  spaced  by  the  fundamental  frequency  (Fq),  three  octaves 
wide  (BW),  ripple  density  (spacing  of  peaks)  of  1  ripple/octave  (RD), 
modulation  depth  (amplitude  of  waveform)  30  dB  (MD),  and  a  6dB  roll-off  to 
maintain  constant  energy  per  octave  (T).  b)  Standard  stimulus  except  ripple 
density  has  been  changed  to  2  ripples/octave,  c)  Standard  stimulus  except 
modulation  depth  has  been  changed  to  10  dB.  Note  that  the  amplitude  maxima 
are  the  same,  but  the  minima  have  changed. 

Fig.  3:  Placement  of  RipplL.  Stimu)”-  \  schematic  of  a  tuning  ciorve 
shows  the  ripple  stimulus  positioned  so  that  the  center  peak  of  the  ripple 
stimulus  is  aligned  with  the  characteristic  frequency  of  the  tuning  curve.  The 
tuning  curve  shows  both  the  excitatory  region  (light  gray),  and  the 
inhibitory  regions  (darker  gray).  With  the  center  peak  of  the  stimulus 
aligned  with  the  CF  of  the  unit,  the  phase  of  the  spectral  envelope  is 
considered  0*. 

Fig.  4;  Ripple  Transfer  Functions:  a)  Post  stimulus  time  histograms 
showing  the  responses  of  units  at  different  ripple  densities.  The  number  of 
responses  and  the  ripple  density  is  shown  next  to  each  plot.  Although  the 
stimuli  were  presented  in  a  pseudorandom  order,  they  have  been  ordered  here 
for  ease  in  interpretation,  b)  The  ripple  transfer  function  plots  the  number  of 
spikes  at  each  ripple  density  against  the  ripple  density.  The  open  circles 
illustrate  one  set  of  ten  pseudorandom  stimtili,  the  closed  circles  represent  a 
second  set.  The  line  is  a  two  point  average  of  the  points,  c)  A  ripple  transfer 
function  shows  that  the  best  ripple  density  is  1.5  ripples/octave,  the  maximum 
response  is  50  spikes,  and  the  minimum  response  is  7  spikes.  The  bandwidth  is 
the  ripple  densities  over  which  the  response  is  greater  than  half  way  between 
the  maximum  and  minimum  response:  in  this  case,  2.8  -  1.0,  or  1.8 
ripples/octave.  The  modulation  index  (RMI)  is  the  difference  between  the 
maximum  and  the  minimum,  divided  by  the  maximum:  RMI  -  (50  -  7)/50,  -  0.86. 

Fig.  5:  Ripple  Transfer  Functions:  Twelve  different  ripple  transfer 
functions  showing  typical  shapes.  The  plots  were  determined  by  using  ripple 
stimuli  with  modulation  depths  of  30  dB.  The  fundan  ental  frequency  was 
constant  for  each  plot,  a  - 1)  Typical  band-pass  filters  with  the  best  ripple 
density  ranging  from  0.5  to  5.0  ripples/octave,  g)  Notch  filter  h)  Low  pass 
filter  i)  Double  band  pass.  These  are  counted  as  band  pass,  and  the  best  ripple 
density  is  the  overall  best  response,  j)  Band  pass  filter,  k)  High  pass  filter. 

The  response  at  the  highest  ripple  densities  never  fell  below  50%  of  the 
maximum  response.  1)  A  mixed  response  or  all-pass. 

Fig.  6;  Frequency  Distributions;  a)  Filter  Types:  Histograms  show  the 
number  of  units  that  are  classified  in  each  particular  filter  type  for  single 
units  and  multiple  units.  There  was  no  significant  difference  between  the  two 
populations,  b)  Best  Ripple  Density:  Histograms  showing  the  number  of  units 
that  have  each  of  the  best  ripple  densities  for  single  units  and  multiple  units. 

The  bin  size  is  0.3  ripples/octave.  There  was  no  significant  difference 
between  the  mean  and  variance  for  these  two  populations,  c)  Bandwidth  of 


1 


RTFs;  Histograms  showing  the  number  of  units  having  the  specified  RTF 
bandwidths  for  single  units,  and  multiple  units.  The  bin  size  is  0.25 
ripples/octave,  d)  Index  of  Modulation;  Histograms  showing  the  number  of 
units  having  the  specified  index  of  modulation  for  single  units,  and  multiple 
units.  The  bin  size  is  0.05. 

Fig.  7;  Phase  Shifts;  Responses  of  six  single  units  as  the  phase  of  the 
spectral  envelope  is  varied.  A  phase  shift  of  zero  corresponds  to  the  center 
peak  of  the  stimulus  being  aligned  with  the  CF  of  unit.  The  dotted  line  is  a 
sinusoid  fit  between  the  maximum  and  the  minimum  response.  The  modulation 
depth  for  all  the  stimuli  was  30  dB.  The  ripple  densities  were  as  follows:  a)  rd  - 
1.0  b)  rd  «=  1.0  c)  rd  -  1.  d)  rd  -  1.8.  e)  rd  -  1.8.  f)  rd  -  1.7. 

Fig.  8;  Frequency  Distribution  for  Phase  Symmetry  Index:  To 
determine  the  symmetry  of  a  phase  response  profile,  the  symmetry  index 
considers  the  phase  required  to  reduce  the  maximum  response  by  50%  as  the 
phase  is  shifted  in  the  positive  and  negative  directions.  The  Index  is  (<I>  pos  - 
<t>neg)/(®  pos  +  <lYieg)-  The  light  hatching  indicates  the  phase  asymmetry  for 
1 1  single  units,  and  the  dark  hatching,  for  1  multiple  unit.  The  bin  size  is  0.1. 

Fig.  9:  Intensity  Shifts:  Responses  of  units  as  a  function  of  the  overall 
intensity.  The  stimuli  all  had  a  modulation  depth  of  30  dB,  and  the  following 
recording  site  characteristics  and  ripple  densities:  a)  single  unit,  rd  -  1.0,  b) 
single  unit,  rd  -  1.0,  c)  single  unit,  rd  =  1.0,  d)  single  unit,  rd  -  0.9,  e)  multiple 
unit,  rd  -  0.9,  f)  single  unit,  rd  -  1.0. 

Fig.  10:  Frequency  Distribution  of  Dynamic  Range  of  Units:  The 
positive  slope  of  the  intensity  response  profile  indicates  the  dynamic  range  of 
the  neurons.  To  normalize  the  response,  the  slope  was  divided  by  the 
difference  between  the  maximum  and  minimum  number  of  spikes.  The  dark 
hatching  represents  28  single  units  while  the  light  hatclung  represents  15 
multiple  units.  The  bin  size  is  0.01. 

Fig.  11:  Modulation  Depth  Response  Profiles:  Responses  of  six  typical 
units  as  the  modulation  depth  of  the  stimulus  is  varied.  Details  of  the  stimulus 
and  recording  site  were  a)  single  unit,  rd  -  1.0,  b)  multiple  unit,  rd  -  1.0,  c) 
single  unit,  rd  -  1.0,  d)  single  unit,  rd  -  0.33,  e)  single  unit,  rd  -  1.5,  f)  multiple 
unit.  (Closed  circles-  rd  -  1.0  ripples/octave;  open  circles;  rd  -  3.0 
ripples/octave.) 

Fig.  12:  Frequency  Distributions  for  Modulation  Depth:  The  histogram 
shows  the  frequency  distribution  for  preferences  in  the  modulation  depth. 

The  lightly  shaded  bars  reoresent  single  units  (n  =  6),  while  the  darkly  shaded 
bars  represent  multiple  units  (n  =  10). 

Fig.  13:  Fundamental  Frequency  Response  Profiles:  Change  of 
response  magnitude  at  best  ripple  density  as  a  function  of  fundamental 
frequency.  The  CFs  for  these  neurons  ranged  from  4.6  kl-Iz  to  7.5  kHz.  Details 
of  the  recording  site  and  the  stimuli:  a)  multiple  unit,  cf  =»  4.6  kHz,  rd  -  1.0,  b) 
multiple  unit,  cf  ■=  5.5  kHz,  rd  =  3.0,  c)  multiple  unit,  cf  -  5,5  kHz,  rd  -  1.0,  d) 
multiple  unit,  cf  -  6.9  kHz,  rd  -  1.0,  e)  sing'e  unit,  cf  -  6.5  kHz,  rd  -  1.5,  f)  single 
unit,  cf  =  7.5  kHz,  rd  =  0.33  ripples/octave. 

Fig.  14:  Spectral  Width  Response  Profiles;  Sbc  typical  spectral  width 
response  profiles  ai’e  shown,  a)  single  unit,  rd  =  1.66,  b)  single  unit,  rd  -  1.0,  c) 


multiple  unit,  rd  -  1.0,  d)  single  unit,  rd  -  1.0,  e)  multiple  unit,  rd  -  1.0,  f)  single 
unit,  rd  -  5.0  ripples/octave. 
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SUMM^OIY  AND  CONCLUSIONS 

(1)  The  spatial  distributions  of  amplitude  tuning  (monotonicity  of  rate-level  functions) 
and  response  threshold  of  single  neurons  were  studied  along  the  dorsoventraJ,  extent  of 
cat  primary  auditory  cortex  (Ai).  To  pool  data  across  animals,  the  multiple  imit  map  of 
monoionicity  was  used  as  a  frame  of  reference.  Amplitude  selectivity  of  multiple  units  is 
known  to  vary  systematically  along  isofrequency  contours,  which  run  roughly  in  the 
dorsoventral  direction.  Clusters  sharply  tuned  for  intensity  (i.e.,  "uon-monotonic" 
clusters)  are  located  near  the  center  of  the  contour.  A  second  non-monotonic  region  can 
be  found  several  millimetcns  dorsal  to  the  center  (Schreiner  et  al.  1992).  The  locations  of 
these  two  non-monotonic  regions  were  used  as  reference  points  to  normalize  data  across 
animals.  Additionally,  to  compare  this  study  to  sharpness  of  frequency  tuning  results 
(Schreiner  and  Mendelson  1990;  Schreiner  and  Sutter  1992;  Heil  et  al.  1992b),  multip  ' 
unit  bandwidth  (BW)  maps  were  also  used  as  references  to  pool  data. 

(2)  The  multiple  unit  amp!  dude- related  topographies  rec  orded  in  previous  studies  were 

confiimcd.  Fooled  multiple  unit  maps  closely  approximated  the  previously  reported 
individual  case  maps  (Schreiner  et  al.  1992)  when  the  multiple  unit  m,onotonicity  or 
BW40dB  used  as  the  pooling  reference.  When  the  BWio^jg  map  was  used  as 

part  of  the  measure,  the  pooled  spatial  pattern  of  multiple  unit  activity  was  degraded. 

(3)  Single  neurons  exhibited  non-monolonic  rate-leve)  functions  more  frequently  than 
multiple  units.  While  common  in  single  neuron  recordings  (28%),  strongly  non¬ 
monotonic  recordings  (firing  rates  reduced  by  more  than  50%  at  high  intensities)  were 
uncommon  (8%)  in  multiple  unit  recordings.  Intermediately  non-monotonic  neurons 
(firing  rates  reduced  between  20%  and  50%  at  high  intensities)  occurred  with  nearly 
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equal  probability  in  single  neuron  (28%)  and  multiple  unit  (26%)  recordings.  The 
remaining  recordings  for  multiple  units  (66%)  and  single  units  (44%)  were  monotonic 
(firing  rates  within  20%  of  the  maximum  at  the  highest  tested  intensity). 

(4)  In  vcnv.ral  AI  (AIv),  the  topography  of  monotonkity  for  single  units  was  qualitatively 
similai'  to  multiple  units,  albeit  single  units  were  on  average  more  intensity  selective.  In 
dorsal  AI  (Aid),  we  consistently  found  a  spatial  gradient  for  sharpness  of  intensity  tuning 
for  multiple  units;  however,  for  pooled  single  units;  in  Aid  there  was  no  clear 
topographical  gradient. 

(5)  Response  (intensity)  thresholds  of  single  neurons  were  not  uniformly  distributed 
across  tlie  dorsovenlral  extent  of  AI.  The  most  sensitive  neurons  were  consistently 
located  in  the  non-monotonic  regions.  The  scatter  of  single  neuron  intensity  threshold 
was  smallest  at  these  locations  and  increased  gradually  toward  more  dorsal  and  ventral 
locations. 

(6)  The  existence  of  a  specialized  region  for  near-threshold  stimuli  along  the  AIv/AId 
border  is  revealed  in  these  experiments.  Neuronal  recordings  in  this  region  are  sharply 
tuned  for  frequency  aiid  amplitude,  have  low  intensity  thresholds,  have  low  scatter  in 
characteristic  frequency  (Cr^  and  threshold,  and  selectively  respond  to  narrow-band 
stimuli  within  40  dB  of  the  cortical  intensity  threshold.  Non-monotonic  neurons  have 
been  shown  to  shift  tlieir  spike  count  vs.  level  functions  linearly  in  response  to  a 
continuous  noise  masker  (Phillips  et  al.  1985;  Phillips  1990).  Neurons  in  the  ventral  non¬ 
monotonic  region,  thus,  might  serve  as  fine  spectraPamplitude  filters  which  only  respond 
to  frequency-banded  components  v/ith  intensities  just  above  the  cat's  threshold  in  the 
presence  of  background  noise. 


iii 
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(7)  The  results  of  this  study  support  the  parceling  of  AI  into  at  least  two  physiologically 
distinct  sub-divisions.  The  ventral  sub-division  (AIv)  has  a  complete  single  unit 
topographical  representation  of  stimulus  intensity.  Low  intensity  signals  elicit  maximal 
response  at  a  signal  detection  region,  located  at  the  dorsal  extreme  of  AIv  at  the  AIv/ Aid 
border.  Neurons  respond  better  to  higher  intensity  signals  progressively  ventral  until  the 
All  border  is  approached.  The  dorsal  sub-division  (Aid)  is  well  suited  for  differential 
frequency  analysis  and  contains  a  single  unit  topography  for  stimulus  bandwidth  as 
previously  reported  (Schreiner  and  Sutter  1992). 


iv 
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INTRODUCTION 


Understanding  the  fuvidamental  patterns  of  representations  of  stimulus  frequency 
and  amplitude  are  requisite  for  understanding  sound  analysis  in  the  auditory  system.  For 
mammals,  the  topographic  representation  of  characteristic  frequency  (CF)  has  been  well 
established  in  primary  auditory  cortex  (AI)  (Merzenich  ct  al.  1975;  Reale  and  Imig 
1980).  Roughly  ordiogonal  to  the  rostrocaudal  tonotopic  gradient  in  the  cat,  sharpness  of 
frequency  tuning  i,i  topographically  represented  (Schreiner  and  Cynader  1984;  Schreiner 
and  Mendelson  1990;  Schttiiner  and  Sutter  1992;  Heil  et  al.  1992b).  Multiple  unit 
clusters  are  sharply  tuned  approximately  at  th.e  center  of  dorso-ventral  oriented 
isofrequency  contours.  Tuning  of  neuron  clusters  becomes  progressively  broader  in  the 
dorsal  and  ventral  portions  of  AI. 

Neurons  in  cat  a  iditory  cortex  that  mspond  weakly  to  high  intensity  -stimuli  'nave 
been  described  since  the  earliest  microelectrode  studies  of  AI  (Erulkar  et  al.  1956;  Evans 
and  Whitfield  1964).  The  firing  rate  of  such  neurons  increases  above  threshold  over 
approximately  a  10  to  40  dB  range,  then  decreases  with  further  elevation  of  stimulus 
intensity  (Brugge  et  al.  1969;  Phillips  and  Irvine  1981;  Phillips  et  al.  1985;  Phillips  and 
Hall  1987).  The  term  "non-monotonic"  has  been  used  to  characterize  these  neuronal 
responses  because  the  general  behavior  of  firing  rate  as  a  function  of  stimulus  intensity  i-s 
non-monotonic  (Greenwood  and  Murayama  1965).  Phillips  and  colleagues  (1985)  have 
reported  that  most  non-monotonic  neurons  are  completely  unresponsive  at  high 
intensities,  and  thus,  their  excitatory  frequency  tuning  curves  can  be  described  as  being 
circumscribed. 

While  earlier  studies  provided  preliminary  evidence  of  spatially  systematic 
representation  of  non-monotonicity  in  AI  (Reaie  et  al.  1979;  Phillips  et  al.  1985),  a 
detailed  description  of  topographic  s  of  amplitude  characteristics  has  only  recently  been 
published.  Multiple  unit  mapping  experiments  by  Schreiner  et  al.  (1992)  have 
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demonstrated  spatial  distributions  of  best-level,  threshold,  and  monotonicity  (the 
sharpness  of  amplitude  tuning)  in  cat  AI.  Near  and  overlapping  the  region  that  is  sharply 
tuned  fof  frequency  in  AI,  there  is  a  strongly  non-monotonic  region.  In  multiple  unit 
studies,  a  second  non-monotonic  region  can  be  located  in  the  dorsal  third  of  AI  (e.g.,  Fig. 
1),  tdihough  the  extent  aiid  location  of  the  dorsal  non-monotonic  region  varies 
substantially  across  animals. 


[Fig.  1  about  here] 

To  date,  intensity  and  frequency  maps  have  been  obtained  using  multiple  unit 
techniques  (Schreiner  and  Mendelson  1990;  Schreiner  et  al.  1992;  Hdl  et  al.  1992a,b). 
Deriving  single  neuron  topographies  is  substamiaiiy  more  difficult.  Single  neuron 
experiments  yield  fewer  recorded  units  because  of  the  difficulties  cncounteiing  and 
holding  single  neurons  Resulting  single  unit  maps,  thus,  either  have  less  dense  sampling 
of  the  mapped  area,  a  smaller  mapped  extent,  or  a  smaller  set  of  parameter  values  tlian 
multiple  unit  maps.  Previous  attempts  to  pool  single  neuron  data  based  on  anatomical 
landmarks  yielded  results  tliat  topographically  were  only  roughly  similar  to  multiple  unit 
studies  (Evans  and  Whitfield  1964,  Goldstein  et  ai.  1970).  For  example,  the  well- 
established  tonotopic  organization  of  AI  was  substantially  degraded  when  such  a  pooling 
technique  was  applied  (Goldstein  et  1970).  Recent  evidence  indicates  that  pooling 
data  ba.<;ed  on  physiological  landmarks  is  more  useful  and  appropriate  (Sutter  and 
Schreiner  1991;  Schreiner  and  Sutter  1992).  Comparing  single  neuron  and  multiple  unit 
topography  is  necessary  to  understand  a  number  of  cortical  properties.  First,  we  can  gain 
an  estimate  of  the  response  diversity  of  the  neural  cleracnts  contributing  to  local 
processing.  We  also  can  asses  how  single  units  contribute  to  topographies  observed  for 
multiple  unit  Finally,  we  can  use  the  interrelationship  of  multiple  and  single  unit 
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topographies  to  try  to  predict  topographical  responses  to  complex  stimuli  by 
understanding  the  local  integration  of  response  properties  of  single  units. 

In  an  earlier  paper  (Schreiner  and  Sutter  199?.),  single  neuron  and  multiple  unit 
shaipness-of-frequency  tuning  maps  were  compared  in  AI  of  the  cat  at  depths  between 
600  and  1000  microns  below  the  cortical  surface.  In  dorsal  AI  (Aid)  multiple  unit  and 
single  unit  topography  of  BW4Q  jg,  (Q40dB)  similar.  However,  in  ventral  AI 

(Alv),  almost  all  single  units  were  sharply  tuned  at  40  dB  above  threshold,  even  thiough 
more  ventrally  located  mulviple  units  showed  a  tendency  towards  broader  tuning.  A  non- 
uniform  distribution  of  local  CF  scatter  across  AI  was  described  tiiat  contributed  to  the 
differences  between  single  and  multiple  unit  responses  in  Alv. 

In  this  paper,  single  and  multiple  unit  topographies  for  the  representations  of 
sharpness  and  sensitivity  of  amplitude  tuning  arc  compared  in  AI  of  the  cat.  Multiple  unit 
maps  of  sharpness  of  frequency  tuning  and  monotonicity  were  used  to  pool  single  neuron 
data  across  several  animals.  Similar  to  the  study  of  excitatory  bandwidth,  topographical 
differences  in  integrative  mechanisms  along  the  isofrequency  axis  of  AI  aj.'e  described. 
Preliminary  results  from  this  study  have  appeared  in  abstract  form  (Sutter  and  Schreiner 
1991b). 
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METHODS 

Surgic  jl  Preparation 

Results  presented  in  this  and  a  previous,  related  report  (Schreiner  and  Sutter 
1992)  were  obtained  in  the  right  heiTiispheres  of  8  young  adult  cats.  Surgical 
preparation,  stimulus  delivery  and  recording  procedures  for  this  report  are  the  same  as 
those  from  a  previous  study  (Schreiner  and  Sutter  1992) 

Bdefly,  anesthesia  was  induced  with  an  intramiiscu!.ar  injection  of  ketamine 
hydrochloride  (10  mg/kg)  and  acctyiproniai«rie  maleatc  (0.28  mg/kg).  After  venous 
cannulation,  an  initial  dose  of  sodium  pentobarbiul  (30  mg/kg)  was  administered. 
Animals  were  maintained  at  a  surgical  level  of  anesthesia  with  a  coatimious  infusion  of 
sodium  pentobarbital  (2  mg/kg/hour)  in  lactated  Ringer's  solution  (infusion  volume:  3.5 
irJ/hour)  and,  if  necessary,  with  supplementary  iniravenous  injections  of  sodium 
pentobarbital.  The  cats  were  also  given  de.\aK;ethasone  sodium  phosphate  (0.14  mg/kg, 
IM)  to  prevent  brain  edema,  and  atropine  sulfate  (1  mg,  IM)  to  reduce  salivation.  The 
teraperatuic  of  the  animals  was  monitored  witii  a  rectal  temperature  probe  and 
maintained  at  37.5^  C  by  means  of  a  heated  water  blanket  with  feedback  control. 

Three  point  head  fixation  was  achieved  with  palatal-orbital  restraint  (Kopf), 
leaving  the  external  mead  unobstructed.  The  head  holder  was  attached  to  the  table  via 
magnetic  stand.  The  temporal  muscle  on  the  right  hemisphere  was  then  retracted  and  the 
later.a!  cortex  exposed  by  a  craniotomy  (approximately  6-8  mm  dorsoventra]  extent  and 
3-6  mm  rostrocaudal).  ITie  dura  overlying  the  middle  ectosylvian  gyrus  was  removed, 
tile  cortex  was  covered  with  silicone  oil,  and  a  photograph  of  the  surface  vasculature,  was 
taken  to  mark  the  electrode  penetration  sites.  For  maintaining  a  scmi-closed  system  with 
a  clear  view  of  surface  vasculature,  occasif'naiiy,  a  wire  mesh  was  placed  over  the 
craniotomy  and  the  space  between  the  grid  and  cortex  was  filled  with  a  1%  solution  of 
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clear  agarose.  ITie  ugaro.«‘*-filied  grid/chambcr  helped  to  Jimin»;-h  pulsation  of  the  cortex 
and  provided  a  fairh  unobsintctcf;  view  of  idenafiabk  loc  ,t.t  ns  across  the  expos'll 
cortical  suriacc. 

For  cat.<:  where  histology  was  performed,  at  the  end  of  the  cxf>cnmcnLs,  the  animal 
was  deeply  anesthetized  a.id  i>?dascd  uanscai dually  with  salrse  followed  by  formalin. 
Crcssyl-violci  staining  was  used  to  reconstruct  electrode  position  from  serial  frontal  50 
micron  sections 

Stimulw:  Generation  and  P:  livers' 

Experiments  were  coriducied  in  a  double- wallcxl  sound-shielded  room  (I AC). 
Auditory  stimuli  were  presented  via  calibrated  hcadphone.s  (STAX  54)  enclosed  in  small 
chambers  that  were  co.nnecmd  to  scund  delivery  tubes  sealed  irtto  the  acoustic  meaii 
[Sokolicn  1981,  U.S  Patent  42516861.  Tlie  sound  delivery  system  was  calibrate*,!  with  a 
sound  level  meter  (Bri'el  &  Kjacr  2209)  and  a  waveform  analyzer  (General  Radio  1521- 
B).  ITie  frequency  msponsc  of  the  system  was  essentially  flat  up  to  14  kHz  and  did  not 
have  major  resonance-s  deviating  more  than  -«•/-  6  dB  from  the  average  level.  Above  14 
kHz,  the  output  roiled  off  at  a  rate  of  10  dB/octavc.  Harmonic  distortion  was  at  least  55 
dB  below  the  primary  (depending  on  the  sampling  rate  and  the  settings  of  the  antialiasing 
low-pass  filler.) 

Tones  were  generated  by  a  microprocesror  (TMS32010;  16  bit  D/A  converter  at 
120  kH^:;  low-pass  filter  of  96  dB/octavc  at  15. 35  or  50  kHz).  Attenuation  was  provided 
by  a  pair  of  passive  attenuators  (Hewlett  Packard  330D).  The  duration  of  each  tone  burst 
was  usual!}'  50  ms,  except  when  it  was  extended  to  85  msec  for  iong-iatency  responses. 
Tire  rise/fall  time  was  3  ms.  The  interstimulus  interval  was  400  to  1000  ms. 
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Recording  Procedure 


Pr^iylcnr  coated  lungsten  microelectro  'CS  (hiicrop  obc  Inc.)  with  iinpe<3anres  of 
1 .0-8.5  MOhm  at  i  kHz  were  intioduce  i  into  the  auditory  cortex  with  a  hydraulic 
microtirivc  (KOPF)  rf'moiely  controlled  bv  a  stepping  tor.  All  penetr  ations  were 
r.  ughly  orthogonal  to  the  br.  in  surface.  The  recordings  reported  here  were  derived  at 
intracortical  depths  ra-iging  from  6(X)  to  1000  v’  n.  as  determined  by  the  mirrodrive 
settin;;.  Dimpling  was  usually  less  than  100  microns,  and  thus  nut  a  m^jor  problem.  In 
several  animals,  histology  indicatcft  that  recordings  were  from  layers  3  and  4.  Neuronal 
activity  of  single  units  o^  small  groups  of  neurons  (2-6  nemons)  v/ere  amplified,  band 
pass  filtered,  and  monitored  on  an  oscilloscope  and  an  audio  monitor.  Multiple  unit 
recordings  were  employed  only  to  map  the  sharpness  of  frequency  and  amplitude  tuning 
ac»-oss  AI.  Recording  multiple  units  allowed  for  coUection  of  enough  data  to  pre-mar  10 
to  30  cortical  locations  in  reasonable  amount  of  time  (less  than  eight  hours). 

Spikf  activity  was  isolated  from  die  background  noise  with  a  window 
discriminator  (BAK  DIS-1).  'Hie  number  of  spikes  per  presentation  and  tbie  anival  time 
of  the  first  spike  after  the  onset  of  the  stimulus  were  recorded  and  stored  (DEC  1 1/73). 
The  recording  window  had  a  duration  of  50  to  85  ms,  corresponding  tc  the  stimulus 
duration  a  nd  excluding  any  offset  respony,e. 

Data  Analysis 


[Fig.  2  sibout  here] 

From  tlie  responses  to  675  different  frequcncy/ievel  combinahoi;.?.  an  objectively 
determined  frequency  response  area  (Fig  2A)  was  construcle d  for  erery  rccuitiing  .site 
(For  more  detailed  explanation  of  FRA  procedure  see  Sutter  and  Schreiner  1991).  If  the 
resulting  FRA  was  not  well-defined  (subjectively  determined  as  no  responses  for  1/2  of 
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Uio  poTiti:  in  the  tuning  curve,  or  for  the  level  of  activity  encou.itcfed  in  AI,  roughly  the 
standard  deviation  oi  responses  greater  than  t  ie  r<  san),  the  j-rjcess  was  repeatec  with  the 
sujae  675  irdmuii  and  the  resuidng  evoked  activity  was  added  to  the  firit.  The  process 
was  repeated  (up  to  ove  times  for  some  multi-peake  l  s.nits)  until  a  well-defined  FRA 
was  obtained.  This  method  has  prrvi  ied  statistically  reliable  characterization  of  cells 
based  on  repeated  measures  controls  (Sutter  and  Schr.  inor  1991,  Tublc.  3',.  Response 
r  easures,  including  p  ak  firiug  rate,  were  calculateu  for  each  stimu'  as  condition  of  the 
I  RA  by  weighted  averaging  vith  the  eight  frequency/iniensity  neighbors  is  describ'id 
previously  (Sutter  and  Schreiner  1991 ). 

Spike  count  versus  i^vel  functions  were  derived  from  each  FRit  (Fig  2B).  The 
functions  were  reconstructed  by  adding  action  potentials  from  a  1/4  octave  bin  ceiitered 
around  the  unit's  CF  (usually  4  different  frequencies)  over  15  dB  (3  levels).  This 
provided  at  least  12  different  stimuli  for  each  tested  intensity  per  repetition  of  the  FRA 
procedure.  Oniy  units  which  were  tested  over  a  45  dB  range  or  more  above  minimum 
threshold  were  cou.sidercd  for  analysis  of  monotomcity.  Fiom  the  objectively  determined 
single  tone  FRAs  and  spike  count  versus  level  functions  several  response  properties  were 
measured. 

a)  Characteristic  frequency  (CF)  =  the  stimulus  frequency  with  the  lowest  sound 
pressure  level  necessary  to  evoke  neuronal  activity. 

b;  Minimum  threshold  =  lowest  intensity  associated  with  stimulus  evoked 
activity  in  the  frequency  response  area. 

c)  Bandwidth  (BWqojB)  =  bandwidth  (in  octaves)  of  pure  tones  to  which  a 
cell  responds  40  dB  above  minimum  threshold  (as  measured  from  the  frequency  tuning 
curve)  For  multi-peakcd  units  the  entire  bandwidth  encompassing  the  excitatory 
response  dotal  bandwidth)  was  measured. 

d)  Biuidwidtli  (BW  ^q)  =  the  bandwidth  (in  octaves)  of  pure  tones  to  which  a  cell 
responds  10  dLi  above  minimum  threshold  (as  measured  from  tlic  frequency  tuning 
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cui  v.r;.  Fot  multi  rjeaked  uni'S  the  entire  bandwidth  enc-jmpassing  die  excitatory 
response  (total  bandwidth)  was  measured. 

i)  bes:  rmplitude  =  The  stimulus  in‘easity  which  elicited  tiic  most  spikes  to  any 

tone. 

0  maximum  iiniig  rate  =  Tit :  number  of  spikes,  per  individual  sumulus 
presentation,  at  the  best  amplitude. 

g)  monotc  licity  ratio  =  The  number  of  spikes  e-iicited  at  the  highc.  t  inte/isity 
level  tested  divided  by  the  number  of  spikes  elicited  at  the  be,st  amplitude. 

[Figure  3  about  here] 

Single  neurons  or  multiple  unit  recordings  were  classified  as  memotonic  ii  their 
monotonicUy  ratio  was  greater  than  0.8  (Figure  3C).  Units  with  ratios  between  0.50  and 
0.80  ff'CK.  classified  oi  intCTncdiateiy  non-mcnotc  tic  (Figure  3B).  Monotomcity  ratios 
less  than  or  equal  to  0.5  were  considered  stfcngly  non-inonotonic  (Figure  3 A). 

Topographical  Classification  of  Single  Neurons 

[Flu.  4  about  here] 

Because  of  the  difficulty  involved  in  recording  enough  parametrically  fully 
characterized  single  neurons  to  construct  a  map  for  each  anirual,  inetiiods  for  pooling 
data  across  animals  were  employed.  Cytoarchitcciure,  vasculature  and  sulca’  patterns 
have  historically  not  been  reliable  landmarks  for  pooling  (Meizenich  et  al.  1975),  so  we 
used  physiological  landmarks  to  directly  compare  multiple  and  single  unit  topography  of 
monotonicity  ratio.  The  pooling  method  used,  tlrerefore,  depends  or  the  multiple  unit 
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infensity-tuniitg  tof  ogkaphy,  which  in  this  study  was  consistent  with  '.hose  of  Schrcnner  ei 


The  axis  for  the  gradients  of  sharpness  of  intensity  tuning  runs  roughly  in  the 
dorsnvcntral  dimer^sion,  oal'>ogona3  to  u»c  isofrequency  ?jds  (Fig.  1).  Two  regions  of 
sharp  intensity  tuning  (non-monctonic  reg'ons)  consistently  can  be  found  along  the 
dorsoventral  axis  using  the  multiple  unit  trchnique(Schrciner  et  al.  1992)  The  tw.  non¬ 
monotonic  regions  can  be  visualiit-id  as  two  minima  in  a  plot  of  monotonicity  ratio  versus 
dorsovcnti'al  location  (Fig.  4), 

To  pool  data  across  animals,  the  two  minima  in  the  monotnnicity  ratio  versus 
dorsoventral  location  plot  for  each  animal  were  used  as  noni):-ilization  points.  The 
raonotonicity  ratios  for  all  recorded  multiple  unit  clusters  for  case  SUTC16  arc  shown  in 
Fig.  4A.  initially,  tlie  most  dorsal  recording  ..'jitc  was  arbiuarily  assigned  a  value  of  0 
(Fig.  4A,  4B).  The  two  highly  non-monotonic  regions  wciu  approximately  1.0  mm  and  ) 

3.8  mm  from  the  most  dorsal  recording  site.  The  data  were  then  re-plotted  using  a  ! 

weighted  0.50  millimeter  .smoothing  algorithm  tFig.  4B,  see  legend).  (The  conventioii  of  \ 

placing  dashed  lines  at  the  two  minima  of  the  monotonicity  ratio  used  in  Fig.  4B  is  also  i 

used  in  other  figures.)  From  the  resulting  smoothed  curve,  the  positions  of  the  minima  in 
monotonicity  ratio  were  extrapolated  (c.g..  Fig.  4B  juid  Fig.  5  A,C). 

Because  the  ventral  and  dorsal  non-monotonic  regions  do  not  have  a  spabally 
exact  relationship  acro.ss  animals,  distance  needed  to  be  noimalized  to  use  tlicse  two 


lucaiions  as  pooling  laiiuinafks.  Tlie  uisiuucc  between  the  two  muiiiua  ranged  between 


2.8  and  3.4  millimeters  with  a  mean  of  3.1  (Miidian  3.05,  N=5),  and  therefore  wa.s 
normalized  to  "3.0"  millimeters  (transfonnation  from  Fig.  4B  to  4C),  approximating  the 
median  value  of  3.05.  For  tlie  case  depicted  in  Fig.  4,  normalization  distorted  distance  by 
about  7%.  This  distortion  can  be  seen  by  directly  comparing  Fig.  4B  and  Fig.  4C.  In  real 
distance  tlie  penetrations  from  this  animal  cover  slightly  less  than  5.0  millimeters  in 
dorsoventral  extent  (Fig.  4B);  however,  after  normalization  the  penetrations  occupy  more 
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tlfiji  5  millimeters  along  tht  iistoneo  normali2!xl  a-ds  (Fig.  4C).  Aftf  rnomialixauoi,  , 
the  dorsal  and  ventral  monotonicity  rat-n  minims  were  arbitradiy  assigned  locations  of 
0.0  and  3  0  millimeters  respectively. 

iFig,  S  about  here] 


The  method  of  pooling  data  across  animals  is  illustrated  in  Fig.  5.  The  un¬ 
normalized  spatial  disurihution:.  oi  single  neurons  are  shown  superimposed  on  ihe 
smoothed  multiple  unit  maps  (open  circles)  for  2  cats  in  panel?  A  and  C  (A  =  cat 
SirrC16,  C  =  cat  SUTC^2).  The  multiple  unit  maps  arc  normalized  as  previously 
described ,  and  the  locations  of  single  neurons  aie  assigned  abscissa  values  from  "he 
normalized  coordinate  system  (Fig.  5B  =  SUTC16  and  Fig.  5D  =  SUTC12).  For  case 
sure  16  (Fig.  5  A  and  C)  single  neurons'  monotonicity  ratios  are  represented  by  crosses 
ard  for  case  RUTC12  (Fig.  5B  and  D)  by  diamonds.  After  nonnaiization,  single  units  for 
both  cases  have  been  assigned  values  relative  to  the  monoionicity  m?  p  and  avc  directly 
comparable  voompare  crosses  and  diamonds  in  Fig.  5E  to  tlrose  of  5D  and  5B).  The  same 
pooling  process,  as  demonstrated  in  Fig.  5  for  cats  SUTC12  and  SUTC16,  was  repeated 
for  all  other  cases. 

To  enable  comparisons  of  intensity-dependent  properties  to  previous  studies,  the 
integrated  excitatoi'y  'oandwidth  of  multiple  unit  recordings  (Schreiner  and  Mendelson 
1990;  Schreiner  and  Sutter  1592)  was  also  used  as  a  pooling  landmark.  For  pooling 


relative  to  bandwidth,  3  measures  have  been  used.  The  dorsovcntral  location  where  the 
multiple  unit  responses  have  the  sharpest  tuning  40  dB  above  threshold  (BW40inin).  die 
dorsovcntral  location  where  the  multiple  unit  responses  have  the  sharpest  tuning  10  dB 
above  threshold  (B  W^Q^iin),  jjid  a  composite  of  the  two  (BW  io/40min)’  die 

average  loca'uon  of  the  BW^Omin  BW4Qj^jjj 


10 


Sutter  :uui  Schreiner:  Single  Unit  Intensity  Topography 


J659-3 


RESULTS 

Results  are  based  on  recordings  oi  i08  single  neurons  recorded  from  9  cats. 
Single  neuron  recordings  were  topographically  localized  with  respect  to  known  muluple 
unit  maps.  Single  neuron  population  results  were  compared  to  147  multiple  units  that 
were  recorded  ir  the  initial  mapping  procedure  from  the  same  set  of  experiments. 

We  .found  neurons  with  a  vi'idc  rangf  of  monotonicity  ratios  (Fig.  3).  Throughout 
AI,  single  neuron  recordings  yielded  a  higher  percentage  of  non-monotonic  responses 
than  did  multiple  unit  recordings.  Single  neurons,  particularly  those  in  ventral  AI,  tended 
to  have  a  steeper  reduction  of  activity  at  high  intensity  levels  (i.e.,  smaller  monotonicity 
ratios)  dian  was  observed  for  multiple  unit  responses.  While  the  multiple  unit  topography 
had  two  clear  non-  monotonic  regions,  the.  dorsal  non-monotonic  region  was  difficult  to 
reconstruct  from  single  unit  data.  Furtiiemtorc,  we  saw  more  local  topographical  scatter 
in  monotcnicity  ratio  and  minimum  titreshold  for  single  neurons  than  for  multiple  units. 

Difierences  between  single  and  multiple  unit  monotonicity  ratio  topographies  can 
be  accounted  for  by  two  sep;irate  results;  (1)  a  greater  scatter  in  intensity  thresholds  in 
monolonic  regions.  The  non-monotonic  regions,  with  Jov'er  threshold  scatter,  contain 
neurons  with  siinilai’  best  amplitudes.  (2)  Non-monotonic  neurons  contiibute  more  spikes 
in  topographically  non-moaoionic  regions  than  in  lopugraphically  monotonic  regions. 

Within-Experiment  Monotoniciry  Topography 

As  reported  in  an  carliei  paper  (Schreiner  ei  al.  1992),  a  dorsal  and  a  ventral  non¬ 
monotonic  region  were  observed  in  AI.  In  some  cases  (4/9),  a  gradient  with  increasing 
sharpness  ol  frequency  tuning  towards  the  ventral  exueme  of  AI  was  found.  In  these 
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animals,  it  appeared  That  this  gradient  was  leading  to  a  3rd  non  monotonic  region  in  or 
near  AH. 

While  the  intensity  dependent  response  properties  of  single  neurons  and  multiple 
units  were  different  in  some  respects,  the  locations  of  the  major  minima  and  maxima  of 
the  monotoaicity  ratios  of  single  and  multiple  unit  responses  corresponded.  Single 
neurons  on  average  had  lower  monotonicity  ratios,  and  much  wider  range  of 
monotonicity  ratios  at  a  given  dorsoventral  location  than  did  multiple  units  (e.g.,  Fig.  5). 
The  single  neuron  topography,  was  qualitatively  similar  to  the  multiple  unit  topography 
for  the  two  animals  with  ihc  larj  ,st  single  unit  samples,  SUTC12  and  SUTC16  (Fig.  5), 
While  similarities  between  single  and  multiple  units  can  be  seen,  the  significance  is  hard 
to  determine  because  of  the  large  scatter  in  values  for  a  given  dorsoventral  location,  the 
small  number  of  samples,  and  the  sharper  tuning  of  single  units.  Therefore  pooling  data 
across  animals  was  necessary  for  quantitatively  verifying  the  observed  single  neuron 
contributions. 

Pooled  Monotonicity  Data 


[Fig.  6  about  hem] 

For  multiple  unit  recordings,  pooling  did  not  distort  the  topographical  distribution 
of  monotonicity  l  atio  which  had  a  .similar  .shape  to  that  described  for  individual  cases.  A 
weak  non-monotonic  region  ne-^r  the  dorsal  non-moriotonic  reference  and  a  stronger  non¬ 
monotonic  region  about  3  mm  ventral  were  observed  (Fig.  6A).  A  Ibiid  nou-monotonic 
region  can  be  seen  at  the  ventral  extreme  of  AI  (from  5-6  mm)  for  both  m'iltiple  and 
singl':  unit  topographies.  This  is  caused  by  the  ccca’  ionally  encountered  non-monotonic 
region  near  the  AL'AII  border. 
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For  the  pooled  data,  differences  in  the  details  of  multiple  and  single  unit  response 
properties  can  be  seen  throughout  the  dorsovcntral  extent  of  AI,  but  in  ventral  A1  the 
general  structure  of  multiple  and  single  unit  topographies  were  similar.  Single  units 
responded  with  a  wider  range  of  monotonicity  ratios  than  multiple  units  throughout  the 
entire  dorsoventral  extent  of  Al  (Fig.  6A.B).  Single  units,  also,  on  average  were  more 
sharply  tuned  for  intensity  than  multiple  units.  While  the  wider  variability  and  sharper 
intensity  tuning  of  single  units  is  apparent,  in  ventral  AI  there  is  a  correspondence  of  the 
topographical  gradients  (Fig.  6C).  A  minimum  in  monotonicity  ratio  is  located  in  the 
ventral  non-monoionic  region  as  determined  by  multiple  unit  mapping,  'fhis  minima  is 
accompanied  by  an  ascending  gradient  in  monotonicity  ratio  for  approximately  1.5  mm 
in  either  direction  for  botli  multiple  and  single  units.  In  dorsal  AI,  however  there  is  no 
correspondence  of  the  pooled  topographies.  The  multiple  unit  gradient  of  monotonicity 
ratio  starts  descending  from  a  maxima  at  about  2  millimeters  from  the  dorsal  non¬ 
monotonic  region,  then  reaches  a  minima  at  the  center  of  the  dorsal  non-monotonic 
region  and  reverses  (Fig.  6A.<^).  The  topography  of  single  unit  monotonicity  ratio, 
however,  is  relatively  flat,  except  at  the  dorsal  extreme  which  only  has  a  sample  of  4 
cells  (Fig,  6B).  The  difference  between  single  and  multiple  units  in  dorsal  AI  indicate 
that  local  differences  in  single  unit  scatter  or  the  spike  contribution  of  sampled  cells  are 
generating  the  observed  multiple  unit  gradients.  The  topographical  variation  of 
correspondence  between  single  and  multiple  unit  responses  demonstrate  that  there  are 
physiological  and/or  organizational  differences  between  dorsal  and  ventral  non¬ 
monotonic  regions. 

One  might  ask  whether  regions  of  low  monotonicity  ratio  are  a  result  of  a  few 
strongly  non-monotonic  cells,  or  of  many  intermediately  non-monotonic  cells.  From  Fig. 
6A,B  it  appears  that  dorsal  AI  has  fevrer  strongly  non-monotonic  neurons  than  ventral 
Al,  but  a  similar  proportion  of  intermediately  non-monotonic  cells.  However  it  is 
difficult  to  determine  percentages  from  this  figure  because  there  are  many  cells  with  a 
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monoionicity  ratio  of  1  with  overlapping  symbols.  To  address  this  question 
quai>.iitatively,  we  classified  neurons  as  monotonic  (monctonicity  ratio  >  0.8),  strongly 
noD-monotonic  (monotonicity  ratio  <0.5),  or  intermediately  non-monotonic 
(monotonicity  ratio  between  0.5  and  0.8).  A  criterion  of  0.8  as  the  monotonic/non- 
monoionic  cutoff  is  somewhat  arbitrary,  and  was  chosen  to  be  conservative  in  classifying 
a  unit  as  "monotonic". 


[Fig.  7  about  here] 

The  percentage  of  non-monotonic  multiple  units  plotted  versus  pooled  location 
shows  the  spatial  distribution  that  would  be  expected  from  the  monoionicity  ratio 
topography  (Fig.  7A).  Namely,  locations  with  more  than  50%  nou-monotonic  neurons 
are  aligned  with  the  locations  that  have  the  lowest  monotonicity  ratios.  There  were 
however,  only  a  few  suongly  non-mcnotonic  multiple  units.  No  spatial  bin  contained 
more  than  25%  of  strongly  uon-monotonic  multiple  units.  The  small  number  of  strongly 
non-monotonic  ttiulUple  unit  responses  roughly  inversely  followed  the  spatial  distribution 
of  monoionicity  ratio.  The  percentage  of  intermediately  non-monotonic  multiple  unit 
responses  show  a  more  pronounced  spatial  distribution  very  similar  to  the  expectations 
from  the  maps  shown  in  Fig.  6C.  These  results  indicate  that  the  spatial  distribution  of 
multiple  unit  monotonicity  ratio  is  a  result  of  an  increase  in  the  percentage  of  non¬ 
monotonic  multiple  unit  responses  at  certain  locations. 

For  single  neurons,  a  similar  straight-forward  explanation  of  the  spatial 
distribution  of  monoionicity  did  not  apply  to  all  cortical  locations.  While  the  distribution 
of  the  percentage  of  non-monotonic  single  neurons  followed  the  topographical 
distribution  for  inonotonicity  raiio  in  venual  AI,  tltis  was  not  as  clearly  expressed  for  the 
dorsal  third  of  AI  (Fig.  6C  and  Fig.  7B).  Although  the  percentage  of  non-monotonic 
neurons  in  dorsal  AI  were  similar  to  the  central  and  ventral  regioii  (>60%),  there  was  a 


14 


Sutter  and  Schreiner:  Single  Unit  Intensity  Topography 


J659-3 


lack  of  strongly  non-mono  tonic  neurons  (<10%)  in  dorsal  AI.  The  high  percentage  of 
intermediately  non-monotonic  neurons  in  dorsal  AI  is  another  property  which 
physiologically  distinguishes  it  from  ventral  AI.  The  differences  in  the  sharpness  of 
intensity  tuning  between  neurons  of  dorsal  and  ventral  non-monotonic  regions  contribute 
to  tire  apparent  topographical  differences  between  single  and  multiple  unit  maps. 

Contribution  of  Firing  Rate  and  IJtreshold 

There  are  at  least  two  additional  contributing  factors  to  hov/  a  multiple  unit 
monotonicity  topography  can  be  created  in  dorsal  AI,  while  a  .similar-  topography  is  less 
evident  in  the  monotonicity  of  single  neurons.  One  is  that  non-rrionotonic  neurons 
contribute  more  spikes  to  cluster  responses  in  non-monotonic  regions  than  do  monotonic 
neurons.  The  other  is  that,  thresholds  of  neurons  might  be  similar  in  non-monotonic 
regions.  The  resulting  frequency  tuning  curves  of  non-monotonic  single  neurons  in  non¬ 
monotonic  regions,  tlius,  would  be  superimposed  on  each  other  creating  a  multiple  unit 
response  which  is  tuned  for  intensity.  By  contrast,  the  threshold  scatter  m  the  mouotonic 
regions  would  obscure  the  intensity  tuning  of  individual  neurons  when  looking  at 
multiple  unit  responses. 


[Fig.  8  about  here] 

Pail  of  tlie  diffeience  between  multiple  and  single  units  in  dorsal  AI  is  due  to 
non-monotonic  neurons  contributing  more  spikes  than  monotonic  ceils.  For  every 
neuron,  the  firing  rate  at  the  best  amplitude,  (FRmaJt)  '"'as  calculated.  Within  each  0.5  or 
1.0  mra  bin,  this  firing  rate  was  added  for  all  non-monotonic  single  neurons  (PPunax 
nuii-monoi-  FRtttax  was  also  added  for  all  monotonic  neurons  in  each  bin  to  arrive  at 
FRmax  n^ono-  FRmax  mono  divided  by  FKmax  non-mono  arrive  at  the  spike 
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ratio  for  any  given  location  (Fig.  8).  Ratios  greater  than  1  indicate  that  non-mcnotonic 
neurons,  on  average,  contributed  more  spikes,  at  best  intensity,  than  monotonic  neurons. 
The  patial  distribution  of  spike  ratios  parallels  that  of  multiple  unit  monotonicity  ratios 
in  the  dorsal  non-monotonic  region.  Notice  that  between  0.0  and  2.0  millimeters  the 
spatial  distribution  of  single  unit  spike  ratio  is  different  than  the  liiat  of  single  unit 
monotonicity  ratio  (Fig.  6).  The  single  unit  spike  ratio  shows  a  gradient  consistent  with 
the  multiple  unit  topography  of  monotonicity  ratio  (Fig.  6  and  Fig.  8),  while  single  unit 
monotonicity  ratio  map  is  flat.  Differential  conuibution  of  spikes  between  monotonic  and 
non-monoiomo  ncuron.«,  therefore,  at  least  partially  contribute  to  the  formation  of  a 
moitiple  unit  topographic  representation  of  monotonicity  from  a  reladvelv  non- 
lopogtaphic  single  neuron  distribution  in  dorsal  Al. 


[Figure  9  about  he;  e] 


Changes  in  threshold  scatter  also  contribute  to  the  creation  of  "raonotonic 
regions"  from  underlying  non-monotonic  cells.  Single  neuron  thrc.sholds  were  less 
scatieied  in  the  multiple  unit  non-monotonic  regions  than  they  were  in  the  mapped 
monotonic  regions.  This  was  apparent  in  individual  cases  as  well  as  in  the  pooled  data 
(Fig.  9).  Notice  that  in  the  non-monotonic  region  near  the  AI/AII  border  the  scatter  wa.s 
large,  probably  reflecting  the  increased  neuron  thresholds  observed  as  the  AI/AII  border 
is  approached  (Schreiner  and  Cynader  1984).  Thresholds  in  tiae  dorsal  and  ventral  non¬ 
monotonic  regions  of  AI  were  among  the  lowest  thresholds  recorded  in  each  animal  (Fig. 
9B).  The  superposition  of  responses  from  non-monotouic  neurons  with  low  thresholds 
contnbutes  to  the  creation  of  regions  of  non-monoionic  multiple  unit  responses. 
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Relation  of  Mono  tonicity  Topography  to  Sharpness  of  Tuning  (BV,’)  Topography 


[Figures  10  and  11  about  here] 


The  monotonicity  topography  is  related  to  the  sharpness  of  frequency  tuning 
topography  reported  in  previous  studies  (Schreiner  and  Mcndclson  1990,  Sutter  and 
Schreiner  1991;  Schreiner  and  Sutter  1992).  For  pooling  data  relative  to  the  most  sharply 
tuned  "center"  of  AI  (such  as  in  Sutter  and  Schreiner  1991;  Schreiner  and  Sutter  1992), 
the  measure  of  the  point  of  alignment  chosen  is  critical.  When  pooled  relative  to  the 
location  of  sharpest  frequency  tuning  as  determined  by  BW4o^  (i.e.,  the  bandwidth  of 
pure-tones  to  which  the  netiron  responds  40  dB  above  the  neuron's  minimum  threshold), 
the  monotonicity  topography  is  similar  to  that  observed  in  individual  cases.  HoA'ever, 
when  the  data  were  pooled  reladve  to  a  measure  that  iircorporates  the  BWjodB 
distribution  the  pooled  topography  was  degraded. 

Relative  to  the  minima  in  the  BW40(j3  map,  the  dorsoventral  properties  of 
monotonicity  ratio  remain  similar  (Fig.  lOA  and  Figs.  1 1  A,  B).  The  ventral  minima  is 
located  from  0.0  to  0.5  mm  ventral  of  the  sharply  tuned  BW40JB  region  and  the  dorsal 
non-monotonic  region  is  2  to  3  ram  dorsal  to  it.  As  with  the  monotonicity  normalization, 
tlie  shape  of  the  mean  monotonicity  rctio  vs.  location  plot  is  similar'  in  ventral  AI,  but 
quite  different  in  dorsal  .AI  (Fig.  lOA).  However,  when  the  data  is  pooled  by  a  measure 
that  includes  BWjodB’  i-®-*  bandwidth  of  pure  tones  to  which  the  neuron  responds  10 
dB  above  threshold,  the  dorsoventral  delineation  for  amplitude  properties  degrades  (Fig. 
lOB).  This  is  consistent  with  the  interpretation  that  non-monotonicity  topography  is  more 
dependent  on  high  intensity  than  near  threshold  properties  of  sampled  neurons. 
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DISCUSSION 


A  series  of  experiments  were  performed  to  investigate  the  topographical 
distribudon  of  response  parameters  that  contribute  to  the  intensity  selectivity  of  single 
neurons  in  cat  AI.  Consistent  with  an  earlier  study  (Schreiner  et  al.  1992),  two  multiple 
unit  non-monoionic  regions  spaced  about  3  millimeters  apart  could  usually  be  identified. 
The  ventral  non-monotonic  region  was  consistently  within  a  millimeter  of  BW40inin* 
location  of  multiple  units  most  sh.^^irply  tuned  for  stimulus  frequency  40  dB  above 
threshold.  Strongly  ncn-monotonic  neurons  were  concentrated  in  this  ventral  non¬ 
monotonic  region.  In  dorsal  AI,  there  were  many  neurons  whose  firing  rates  were 
reduced  at  high  intensities.  The  magnitude  of  this  reduction  on  average  was  smaller  tlian 
those  in  the  ventral  non-monotonic  region. 

While  the  ventral  non-monotonic  gradient  could  be  clearly  identified  by 
topographically  pooling  the  mean  monomnicity  ratios  for  sampled  single  units,  the  dorsal 
gradient  could  not  be  detected  in  the  pooled  single  unit  response  data.  Differences 
between  single  unit  and  multiple  unit  representational  topographies  in  dorsal  AI  could  be 
accounted  for  by  topographical  differences  in  the  local  threshold  scatter  of  contributing 
neurons,  and  by  the  spike  contributions  (response  strength)  of  non-  monotonic  single 
neurons. 

The  topographical  distribution  of  multiple  unit  maps  was  not  lost  by  the 


employed  pooling  meiliods.  While  pooled  monotonicity  maps  were  sul’ficiently  preserved 


using  BW40niin  —  ^  pooling  landmark,  they  were  substantially  degraded  when  other 
landmarks,  such  as  BW  lOmin  were  included. 
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Methodological  Implications 

The  method  of  determining  the  degree  of  monotonicity  in  this  paper  is 
substantially  different  from  the  methods  applied  by  others  (e.g.,  Phillips  el  al.  1985: 
Phillips  1985).  In  eai/lier  work,  rate-level  functions  were  collccte.d  with  50  repetition 
peri-time  stimulus  histograms  PSTHs.  Maximal  firing  rales  were  often  below  1  spike  per 
presentation  since  AI  ceils  strongly  habituate  to  repeated  presentation  of  the  same  tone. 

By  collecting  data  for  FRA's  with  a  pseudo-random  order  of  675  different 
frequcricy/level  combinations,  we  substantialiy  reduced  habituation  effects  because  the 
average  time  between  similar  stimuli  was  longer.  An  example  of  reduced  habituation  is 
shown  in  Fig.  12.  'i  he  response  measured  over  a  0.1  octave  band  is  comparable  to  the 
response  to  5  presentations  of  CF  tones  (Fig.  12E,  inter-stimulus  interval  =  750  msec). 
The  5  preseiitauons  of  CF  tones  vverc  die  first  5  from  a  stimulation  paradigm  of  50. 

When  all  50  repetitions  were  used  for  analysis,  the  average  response  per  stimulus 
presentadon  w!is  approximately  half  as  strong  (Fig.  12),  demonstrating  that  the  cell  was 
at  least  pardally  habituated  for  the  last  45  sdmulus  presentadons. 

[Fig.  12  about  here] 

Reduced  habiluadon  introduced  by  our  method  is  uaded  off  with  problems 
introduced  by  adding  responses  to  tones  within  a  1/4  octave  of  CF.  By  adding  responses 
across  more  than  1  frequency,  we  may  have  overesdmated  the  percentage  of 
intermediately  non-monotonic  neurons.  Phillips  and  colleagues  (1985)  repoited  that 
almost  all  non-monotonic  neurons  were  completely  inhibited  at  high  intensides.  The  high 
proportion  of  intermediate  non-monolonic  neurons  encountered  in  this  study  might  be 
partly  accounted  for  by  methodological  differences.  Neurons  that  do  not  respond  to  high 
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intensity  CF  tones  might  still  respond  to  tones  whose  frequencies  arc  within  a  quarter 
octave  of  CF.  The  neuron  whose  FRA  is  depicted  in  Fig,  2  and  whose  spike  count  versus 
level  functions  are  displayed  in  Fig.  12  exemplifies  such  an  "oblique"  frequency  tuning 
curve.  The  neuron  s  firing  rate  falls  off  rapidly  at  the  CF,  such  that  there  are  no  CF 
responses  above  45  dB  SPL;  however,  at  intensities  greater  than  45  dB,  there  were  still 
responses  at  frequencies  below  the  CF.  The  low-frequency  response  causes  the  FRA- 
determined  spike  count  versus  level  profile;  ‘.o  fall  off  less  steeply  than  the  PSTH- 
determined  spike  versus  level  profile  (Fig.  12A).  Non-monotonic  neurons  with  oblique  or 
tilted  frequency  tuning  curves  have  been  previously  shown  (e.g.,  Fig.  3  of  Phillips  ct  al. 
1985).  Many  of  the  intermediate  non-monotonic  neurons  identified  in  this  study  had 
oblique  frequency  tuning  curve.s.  There  is  no  evidence  that  tlie  intermediate  non¬ 
monotonic  neurons,  as  defined  in  this  study,  had  spike  versus  level  functions  asymptoting 
at  intermediate  response  rates.  Instead,  almost  all  intermediate  non-monotonic  neurons 
showed  a  downward  slope  of  their  spike  versus  level  functions  at  the  highest  tested 
intensities  (e.g,,  Fig  3).  Since  the  intermediately  non-monotonic  neurons  had  not  reached 
an  asymptote  or  zero  response  at  the  highest  intensity  tested,  we  can  neither  confirm  nor 
rule  out  the  existence  of  non-monotonic  neurons  that  asymptote  at  intermediate 
intensities. 


Another  methodological  consideration  is  the  recording  depths  encountered  in  this 


study.  While  histology  has  couiinucd  that  iccordings  were  restricted  to  layers  3  and 


A 

*T  Ali 


several  animals,  we  cannot  rule  out  the  possibility  that  some  recordings  extended  beyond 
these  boundaries  in  animals  for  which  no  histology  was  performed.  This  should  not  pose 
a  problem  because  the  number  of  such  recordings  would  have  to  be  minimal  and,  thus, 
would  be  averaged  out  by  pooling.  This  study  did  not  investigate  the  effects  of  depth  on 
single  unit  topography,  but  rather  attempted  to  sample  from  a  slab  corresponding  to 
layers  3  and  4.  White  multiple  unit  maps  of  intensity  are  stable  with  recording  depth 
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(Schreiner  et  al.  1992),  single  unit  topographies  need  not  be.  Therefore,  the  effect  of 
recording  depth  is  .sull  an  open  question. 

Inter-animal  variability 

Pecent  multiple  unit  mapping  studies  (Schreiner  ct  al.  1992;  Heil  ei  al.  1992b) 
have  reporteti  reproducible  topographical  distributions  of  bandwidth  and  intensity  in  cat 
Al.  Although,  general  features  of  the  topography  arc  highly  reproducible,  tltere  is  a  high 
degree  of  intcr-aniiiial  variability  it-  the  details  of  the  topographic.^.  The  details  of  inter- 
aninial  variability  are  lost  in  the  pooling  process.  Pooling  data  jrcross  animals,  however, 
allows  us  to  compare  the  ino.st  reliable  and  reproducible  aspects  of  the  topographies 
across  animals  for  single  and  multiple  units.  In  these  studies,  the  n'cliable  topographical 
landmarks  are  a  region  of  multiple  unit  respomes  sharply  tuned  for  frequency  and  two 
regions  of  multiple  unit  responses  sharply  tuned  for  intensity  (Schreiner  and  Mendclson 
1990,  Schreiner  et  al.  1992;  Heil  et  al.  1992b).  Our  ability  to  obtain  pooled  results  based 
on  these  topographical  properties  is  a  testament  to  the  reliability  of  these  aspects  of  tl»e 
multiple  unit  topography  from  animal  to  animal.  Our  method,  however,  ignores  the  inter¬ 
animal  variability  described  in  these  topographies.  Tlicrefore,  it  is  important  to  remember 
that  Uie  goal  of  Utese  experiments  was  to  compare  multiple  and  single  unit  responses.  In 
single  unit  studies,  one  cannot  obtain  a  large  enough  extent  and  fine  resolution  to 
measure  idiosyncratic  inter-animal  variability  over  completely  mapped  areas  (which 
requires  961  locations  to  map  out  a  6  mm  by  6  mm  grid  with  200  micron  resolution). 
Additionally,  pooling  single  and  multiple  unit  data  within  the  same  animal  will  yield 
irregular  maps  because  of  differences  in  single  unit  and  multiple  unit  topographies 
(Schreiner  and  Sutter  1992).  Comparing  single  unit  and  multiple  tmil  topographies  allow 
u.s  to  gain  a  better  understanding  of  the  basic  features  of  single  unit  topography 
underlying  Uic  observed  multiple  unit  properties. 
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An  argument  can  be  put  forth  that  our  not  finding  a  pooled  monctonicity  ratio 
gradient  for  single  units  in  dorsal  AI  is  due  to  inter-anijnal  variability.  It  is  possible  that 
within  each  animal  there  is  a  weak  single  unit  gradient  for  monotonicity  ratio.  However, 
there  arc  several  arguments  against  pooling  being  the  sole  reason  for  the  absence  of  a 
gradient  of  pooled  single  unit  monotonicity  ratio  in  dorsal  Al.  First,  we  do  see  a  gradient 
for  pooled  multiple  unit  maps  of  intensity  parameters  in  dorsal  A?..  Therefore,  if  inter¬ 
animal  variability  is  the  cause  of  die  lack  of  a  gradient,  the  variability  has  to  be 
constrained  to  single  units  while  not  being  present  in  multiple  units.  Second,  we  see 
spatial  gradients  ol  .spike  ratio  and  threshold  for  single  units  in  dorsal  AI.  'inese  results 
indicate  that  the  lack  of  spatial  gradient  or  very  weak  spatial  gradient  of  monotonicity 
ratio  in  dorsal  AI  is  not  solely  a  result  of  pooling  data  across  ajiimals. 

Comparinon  to  Previous  Studies 

The  percentage  of  non-monotonic  neurons  reported  in  this  paper  are  consistent 
witli  previous  results  in  cat  AI.  Recent  studies  from  Uic  cat  have  reported  approximately 
half  of  AI  cells  to  be  non-monoionic.  When  comparing  neuron  populations  across 
investigators,  the  possibility  of  topographical  sampling  biases  (particulaily  towards 
central  and  ventral  AI  in  recent  reports)  must  be  kept  in  mind  (Sutter  and  Schreiner 
1991).  Recording  from  similar  cortical  dcpUis,  Phillips  et  t'l.(1985)  found  that  44%  of 
single  neurons  (N?=61)  are  strongly  non-monotonic,  13%  intennediately  non-monotonic 
and  43%  monotonic.  In  a  preliminary  di.sclosurc,  Barone  and  colleagues  (1990)  have 
reported  that  40%  (data  base  of  333  neurons)  of  recorded  AI  neurons  were  strongly  non¬ 
monotonic.  In  all  regions  of  AI,  the  reported  percentage  of  uon-monotonic  neurons  in 
this  paper  is  similar  to  tliose  of  previous  studies.  In  the  two  millimeters  of  the 
dorsoventral  extent  of  AI  centered  on  BWiQ/4Qjjjjn'  33%  of  neurons  arc  strongly  non¬ 
monotonic,  28%  arc  intermediately  non-monotonic,  and  39%  monotonic.  Dorsal  to  this 
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region  10%  arc  strongly  non-monotor-ic  and  38%  are  intermediately  non-moj'otonic. 
Ventral  to  this  region,  35%  are  strongly  non-monotonic  and  19%  are  intermediately  non¬ 
monotonic.  These  results  are  consistent  with  previous  findings  tliat  roughly  40-60%  of 
AI  neurons  are  non-monotonic  (sec  also  Figs.  7  and  11).  The  results  from  the  central  and 
ventral  region  arc  consistent  with  the  findings  tliat  most  non-monotonic  neurons  in 
'classic:!!'  AI  are  strongly  non-monotonic  (Phillips  et  al.  1985). 

The  percentage  of  non-monotonic  neurons  is  substantially  different  between  AI  of 
the  cat  and  AI  of  Uie  mustached  bat.  Most  of  the  neurons  in  bat  AI  are  intensity  selective, 
although  only  25  of  153  (16%)  were  completely  inhibited  at  high  intensities  (Suga  and 
Manabc  1982).  Of  540  neurons  in  the  same  study,  only  19  were  monotonic.  If  we  assume 
tlien  dial  at  most  19  of  153  neurons  were  monotonic,  at  least  87%  of  bat  Al  neurons  were 
non-monotonic,  and  at  least  71%  of  AI  neurons  were  intermediately  non-monotonic.  The 
results  of  Suga  and  Manabe  indicate  that  the.  percentages  and  strengths  of  non-monotonic 
neurons  in  AI  can  vary  substantially  across  .species. 

lliis  interpretation,  though,  is  s>abjcct  to  a  severe  limitation  due  to  differences 
between  free- field  and  dichotic  stimulus  presentation  methods.  In  the  bat  study,  auditory 
stimuli  are  presented  frce-ficid,  while  in  the  cal,  stimuli  were  presented  monaurally.  Hie 
percentage  of  non-monotonic  units  found  in  the  medial  geniculate  body  (MGB)  and 
inferior  colliculus  (IC)  indicate  that  higher  percentages  of  non-monotonic  neurons  arc 
reported  in  experiments  performed  under  frec-field  conditions.  In  free-field  studies  of  the 
cat,  80%o  (Aitken  1991)  of  IC  neurons  were  non-monotonic.  However,  recording  from 
MGB  under  monaural  conditions  there  is  a  progression  in  tlie  percentage  of  non¬ 
monotonic  neurons  from  3U%  in  the  anterior  portion  to  60%  in  tlie  posterior  portion  of 
MGB  of  the  cat  (Rodrigucs-Dagaeff  ct  al.  1989).  Under  monaural  stimulation  conditions 
in  the  guinea  pig,  24%  of  IC  neurons  were  non-monotonic  (Rees  and  Palmer  1988). 
Across  tlicse  experiments  neuronal  recordings  perf  ormed  under  free-field  conditions 
consistently  result  in  a  larger  percentage  of  non-monotonic  cells  than  dichotic 
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coDtralaterai  stimulation.  One  might  expect  to  see  higher  percentages  of  nori-monotonic 
neurons  under  free- field  conditions  because  of  neurons  suppressed  by  the  ipsilateral  ear 
witir  higher  thresholds  than  the  excitation  threshold  in  the  contralateral  ear.  Confounding 
the.  results  further  arc  the  different  manners  in  which  investigators  define  the  boundaries 
of  brain  regions.  The  relatively  few  studies  of  amplitude  processing  in  the  central 
nervous  system,  coupled  witli  a  large  variation  m  methodologies  used,  make  it  difficult  to 
conclude  whether  there  arc  species  dependent  or  brain  region  dependent  differences  in 
tlic  percentages  of  uon-monotonic  neurons. 

While  the  percentage  of  non  monotonic  neurons  may  be  different  between  cat  and 
mustached  bat,  topograpnied  representations  of  stimulus  intensity  can  be  obser/cd  in 
both  species.  Besides  the  cat,  functional  maps  of  amplitude  related  parameters  in  AJ  only 
have  been  demonstrated  in  bats.  In  the  mustached  bat  tljerc  is  a  map  of  best  amplitude 
that  runs  roughly  orthogonal  to  the  frequency  map  (Suga  ’977;  wSuga  and  Mauabe  1982). 
For  echolocation,  the  amplitude  of  the  returned  pulse  conveys  information  about  object 
size.  The  findings  of  topography  of  mcmotonicity  and  best  level  in  the  cat  (Schreiner  ct 
a!.  1992)  combined  witii  the  single  neuron  results  presented  in  this  study  indicate  that  a 
topographical  representation  of  intensity  parameters  might  be  a  general  mammalian 
auditory  cortical  property.  For  both  species,  the  junplitudc  of  the  stimulus  also  appeal's  to 
be  encoded  orthogonal  to  the  tonotopic  axis  in  AI.  While  Suga  and  Manabe  did  not  study 
the  sharpness  of  amplitude  tuning  (they  studied  best-level),  there  is  some  indication  that 
there  is  an  orderly  organization  of  degre:;  of  monotonicity  (see  Fig.  10  from  Suga  and 
Manabe  1982). 

Possible  Mechanisms  Creating  Non-monotonicity 

An  aigumcnt  can  be  put  forth  that  neuronal  intensity-tuning  is  solely  a  result  of  the 
broader  frequency  response  of  the  basilar  membrane  at  higher  intensities  (B6k6sy  1960; 
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Rhode  1971;  Ruggero  and  Rich  1991)  impinging  on  inhibitory  sidebands,  'The  broadening 
of  tuning  is  due  to  the  mechanical  properties  of  the  basilar  membrane  and  effects  of 
stimulus  shaping.  ITierefoie,  under  these  circumstances  lateral  inhibition  is  converted  to 
intensity  tuning.  If  a  splattering  of  energy  into  inhibitory  sidebands  is  contributing  to  the 
creation  of  intensity-selective  neurons,  one  would  expect  to  find  inhibitory  sidebands 
approximately  40  dB  below  the  highest-intensity  at  which  the  cell  is  excited.  However, 
low  excitatory  thresholds,  .such  as  tliosc  reported  for  cells  in  the  non-monotonic  areiis,  arc 
unrelated  to  this  potential  mechanism.  Phillips  (1988)  has  provided  evidence  that  spectral 
splatter  from  inhibitory  sidebands  contributes  to  intensity  tuning  by  demonstrating  that  for 
some  non-monotoiiic  cells  intensity-tuning  becomes  sharper  witli  more  rapid  tone  onsets. 
Based  on  non-inonotonic  cells  responses  to  broad-band  noise  Phillips  and  Cynader  (1985) 
liave  also  hypotliesizcd  that  non-monotonicity  might  arise  irom  inhibitory  neurons  with 
similar  CFs  but  higher  thresholds.  We  have  recently  collected  data  demonsuating  non¬ 
sideband  high  threshold,  CF  inhibition  contributes  to  intcitsity  tuning  (personal 
obseivation).  Tlic.se  results  demonstrate  tliat  the  auditory  sy.stem  can  use  .sideband  (lateral) 
and/or  non-sideband  inhibition  to  create  intensity  tuning. 

While  it  is  probable  tl^at  sideband  inhibition  contributes  to  intensity-tuning  ,  tliis 
docs  not  aiguc  that  intensity-tuning  observed  in  AI  is  not  functionally  relevaiiL 
Establishing  the  significance  of  intensity-tuning  is  a  behavioral  and  functional  issue,  and 
not  only  an  issue  of  mechanism.  In  fact,  one  would  expect  the  auditory  system  to  exploit 
spectral  splatter  for  intensity  aiiidysis  since  tliis  is  a  natural  property  of  sound  stimuli. 

Functional  Significance  of  Intensity  Tuning 

Although  many  studies  have  been  devoted  to  determining  the  functional  role  of  the 
auditory  cortical  fields,  a  resolution  of  distinct  functions  has  not  been  achieved.  There  is 
strong  evidence  tliat  the  AI  is  involved  in  sound  iocalixation  (e.g.,  Heffner  and  Masterton 
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1975;  Thompson  and  Cortez  1983;  Jenkins  and  Merzenich  1984).  There  is  also  data 
which  supports  the  hypothesis  that  AI  is  involved  in  performing  vocalization  analysis 
(Gazzaniga  ct  al.  1973;  Coslctt  et  al.  1984;  Heffner  and  Heffner  1986a,b,  1989a, b),  In 
bats,  there  is  e\'idence  that  auditory  cortex  performs  temporal -spectral  analysis  (Suga 
1989;  Simmons  et  al.  1990). 

in  addition  to  sound  locaiizati,on,  temporal-spectral,  and  vocalization  analysis, 
there  is  a  large  body  of  data  indicating  that  auditory  cortex  is  involved  in  analyzing  the 
intensity  of  time-varying  frequency  components  comprising  auditoiy  scenes  (e.g.. 

Simmons  et  al.  1990;  Dear  ct  al.  1993).  The  presence  of  intensity  maps  in  AI  provides 
strong  circumstantial  evidence  tliat  AI  is  contributing  to  intensity  analysis  (e.g.,  Suga  and 
Manabe  1982).  While  long-duration  purc-tone,  supra-threshold  intensity  discrimination 
seems  to  be  intact  in  anim  us  with  auditory  cortical  lesions  (Swisher  1967,  Neff  ct  al. 

1975),  intensity  clis'  iunination  appears  to  be  impaired  for  short-duraticn  tones  (Cranford 
ct  al.  1982).  Additionally  data  from  complex  signal  discrimination  and  detection 
experiments  (Heilman  ct  al.  1973;  Olsen  et  al.  1975;  Blacttner  ct  al.  1989;  Simmons  ct  al 
1990),  lesion  studies  (Perrier  1876, 1889;  Munjyama  and  Kanno  1961;  Heffner  and 
Heffner  1986a,  1989a,  1990;  Jerger  et  al,  1969,  Auerbach  et  al.  1982),  and  sound- 
localization  studies  (Imig  et  al,  1991,  Rajan  ct  al  1990a;)  provide  evidence  tlrat  the 
auditory  cortex  is  involved  in  low-intensity  signal  detection,  amplitude-spectrum  analysis, 
signal/noisc  analysis,  and  amplitude  modulation  analy.sis. 

Auditoiy  cortex  has  been  shown  to  be  involved  in  analyzing  many  attributes  of 
signals.  Tlie  physiological  organization  of  auditory  cortex  is  well  suited  for  such  multi¬ 
dimensional  analysis.  In  AI  of  tlie  Musiached-bai,  cells  arc  tuned  to  Dopplcr-shifted- 
frequency  (Suga  and  Jen  1976),  amplitude-spectrum  (Suga  1977),  echo  delays  (Fitzpatrick 
1993),  and  to  specific  intraspecies  communication  vocalizations  (Kanwal  el  al.  1994; 
Ohlcmiller  ct  al.  1994).  Like  the  bat,  AI  of  the  cal  has  representations  of  multiple  signal 
parameters  (Schreiner  et  al  1988).  The  presence  of  multiple  auditory  cortical  fields  and 
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sub-divisions  of  these  fields  argues  for  multiple  functions  in  auditoi^  cortex.  Therefore  it  is 
reasonable  to  suppose  that  AI  can  be  involved  in  processing  many  different  signal 
parameters,  Ultimately  the  only  way  to  address  the  functional  significance  of  the  intensity 
maps  reported  in  this  paper  directly  is  to  perform  combined  behavioral  and  physiological 
studies  of  the  role  of  AI  in  intensity  analysis. 

Implications  for  Sound  Localization 

Recently  there  has  been  a  resurgence  in  interest  in  sound  localization  sensitivity 
of  AI  neurons  (Rajan  ct  al.  1990  a,b;  Ahissar  et  al.  1992).  Non-monotonic  neurons  have 
been  shown  to  be  predominantly  directionally  selective  (Imig  et  al.  1990).  The  spatial 
disnibution  of  minimum  tliresholds  (Fig.  9)  indicates  that  the  ventral  multiple  unit  non¬ 
monotonic  region  with  its  restricted  intensity  response  and  threshold  range  cannot  encode 
intensity-independent  location.  While  the  cells  in  this  topographical  region  would  likely 
be  directionally  selective,  the  paucity  of  monotonic  cells  tliat  could  fire  at  high  intensities 
indicate  that  tliis  region  can  only  encode  location  al  Jov.'  intensities.  This,  (;oupled  with 
the  fact  that  monolonic  cells  are  less  likely  to  be  dircctionaiiy  selective,  make  the  vei.xtral 
non-monotonic  region  a  poor  candidate  for  Icvcl-tolci  ant  spatial  encoding.  The 
monotonic  multiple  unit  regions  would  be  better  candidates  to  serve  such  a  function, 
since  there  is  a  full  representation  of  intensities  (including  many  monaurally  non¬ 
monotonic  single  neurons  and  possibly  more  free- field  nou-monotonic  neurons).  The 
recent  data,  however,  indicate  tliat  a  simple  intensity-tolerant  map  of  auditory  space  is 
unlikely  to  be  found  witliin  cat  AI. 
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Possible  Purpose  of  Non-monoionic  Regions:  Signal  Detection? 

If  the  non-mouotonic  multiple  unit  regions  in  AI  are  not  encoding  intensity- 
independent  location  information,  what  are  they  encoding?  One  possibility  is  that  the 
strongly  non-monotonic  neurons  arc  encoding  spectral  information  just  above  the 
background  noise.  The  ventral  non-monotonic  region,  which  closely  overlaps 
could  ser\'s  this  purpose  well. 

The  hypothesis  of  an  area  specialized  for  signal  analysis  just  above  background 
noise  is  supported  by  (1)  the  low  thresholds  of  these  neurons  (Fig  9),  (2)  the  low  scatter 
in  frequency  and  threshold  values,  (3)  tlie  sharp  frequency  tuning,  (4)  the  permanent 
increases  in  intensity  thresholds  following  AI  lesions  (Muruyama  and  Kanno  1961; 
Heffner  and.  Heffner  1986a,  1990),  and  (5)  the  effect  of  an  increase  of  background  noise 
on  the  responses  of  non-monotonic  neurons  (Phillips  1985,1990;  Phillips  and  Cynader 
1985).  Neurons  in  the  ventral  non-monotonic  region  consistently  had  the  lowest 
thresholds  within  the  isofrequcncy  domain  and  have  the  smallest  range  of  threshold 
values  (Fig  9).  This  implies  that  the  intensity  sensitivity  is  resLicted  to  a  small  range  of 
low  levels.  When  tones  are  presented  in  the  presence  of  continuou.s  (ongoing) 
background  noise,  the  spike  count  versus  intcn.sity  function  of  non -monotonic  neurons 
shifts  as  a  linear  function  of  the  intensity  of  the  noise  masker  (Phillips  1985,1990; 

Phillips  and  Cynader  1985). 

Along  with  the  prediction  that  neurons  in  the  vcnu-al  non-monotonic  regions  are 
able  to  shift  tlicir  spike  count  versus  level  functions  relative  to  background  noise,  these 
neurons  respond  to  a  narrow  range  of  frequencies,  and  respond  weakly,  if  at  all,  to  broad¬ 
band  stimuli  (Phillips  et  al.  1985;  Schreiner  et  al.  1988;  Schreiner  and  Mendclson  1990). 
Accordingly  the  spectral  analysis  of  signals  close  to  the  background  signal  benefits  from 
a  narrow  band  analysis.  Limiting  bandwidth  enables  the  neurons  to  use  coherent 
modulations  to  detect  signals  in  noise.  The  ventral  non-monotonic  region  approximately 
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lines  up  with  BV/jOmin.  Ae  lowest  threshold  region.  Topographically, 

the  region  near  the  "physiological  center"  of  AI  is  highly  selective  for  detec  ting  the 
frequency  and  amplitude  of  narrow-band  sdaiuli  just  above  the  background  noise,  and 
might  be  interpreted  as  a  sub- region  specialized  for  signal  detection.  The  narrow- band 
frequency  response  of  this  area  and  the  ."nai:al  selectivity  of  non-monotonic  neurons 
(Imig  et  al.  1990)  combined  with  their  low  thresholds  indica'vC  that  the  two  non¬ 
monotonic  regions  may  play  an  important  role  iu  spectral-spatial  processing  for  stimuli 
whose  intensities  arc  within  about  40  dB  of  die  detection  threshold  of  background  noise. 
Note  that  natural  stimuli  to  which  tliis  region  would  responu  must  contain  at  least  one 
narrow-band  because  most  non-monotouic  units  do  not  respond  well  to  broad-band 
stimuli  (Phillips  et  al,  1985,  Schreiner  and  Mendeison  1990). 

Parceling  ofAI 

The  results  of  this  study  raise  some  questions  about  the  division  of  AI.  Recent 
papers  (Sutter  and  Schreiner  1991;  Schreiner  and  Sutter  1992)  suggested  that  there  are  at 
least  two  physiologically  distinct  regions  in  AI.  In  dorsal  AI  (Aid)  there  is  a  single  unit 
map  for  stimulus  bandwidth.  Broadly  tuned  neurons  are  located  dorsally  and  there  is  a 
gradual  progression  of  narrower  tuning  toward  the  dorsoventral  center  of  AI.  In  ventral 
AI  (AIv),  most  single  units  are  sharply  tuned  for  frequency.  The  amplitude  tuning 
described  in  this  paper  provides  further  support  for  the  physiological  sub-division  of  AI 
into  two  parts.  In  ventral  AI,  where  there  is  no  topographical  gradient  for  the  bandv/idth 
of  single  units,  there  is  a  single  unit  map  for  stimulus  intensity.  However  in  dorsal  AI, 
where  there  is  a  single  unit  map  for  bandwidth,  there  is  no  single  unit  map  for  stimulus 
intensity. 

While  subdividing  AI  into  at  least  two  physiological  sub-divisions  is  supported  by 
the  data  from  this  and  other  studies,  the  dividing  line  between  AIv  and  Aid  remains 
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uncertain.  The  implied  dividing  line  from  the  previous  study  was  the  BW  iQMOmin  (Q* 
max  in  Sutter  aiid  Schreiner  1991)  as  derived  from  multiple  unit  mapping  of  BW4()£j3 
and  E WjodB  •  In  this  study,  BW^oMOmin  was  not  as  strong  a  topographical  pooling 
landmark  as  \  ;as  BW40dB  or  the  monotonicity  map.  The  topographical  analysis  for 
monotonicity  was  critically  dependent  on  using  high  intensity  properties  of  neurons. 

With  this  in  mind,  the  question  must  be  raised  whether  there  is  a  clear  dividing 
line  between  dorsal  and  ventral  AI.  The  answer  appears  to  be  tliat  there  is  no  sharp 
boundary  or  unambiguous  line  that  can  be  drawn  to  determine  functionally  distinct  Al 
regions.  The  AIv,  Aid  classification  from  Schreiner  and  Sutter  (1992)  was  adequate  for 
studying  bandwidth  properties.  For  studying  the  spatial  distribution  of  a  given  filter 
property,  the  multiple  unit  topography  or  that  property  serves  as  the  best  topographical 
pooling  landmark.  The  EW40dB  Probably  is  the  most  useful  predictor  of  the 
location  of  otlier  response  properties  studied  up  to  this  time.  A  low-threshold  region,  with 
neurons  sharply  tuned  for  frequency  ana  amplitude,  comprises  about  plus  or  minus  1 
millimeter  from  BW40niin'  Ventral  to  BW40niin,  ahnost  all  single  neurons  are  sharply 
tuned  for  frequency,  and  there  is  a  single  unit  map  of  stimulus  intensify.  Dorsal  to 
BW40niin’  there  is  no  clear  single  unit  map  of  intensity,  but  there  is  a  gradient  for 
bandwidth.  The  sharpness  c  f  frequency  tuning  in  Aid  closely  follows  the  multiple  unit 
map,  while  amplitude  tuning  does  not.  The  construction  of  the  dorsal  non-monotonic 
region,  which  is  present  in  multiple  unit  monotonicity  map,  but  not  in  pooled  .single 
neuron  maps,  can  be  accounted  for  by  a  combination  of  threshold  scatter  and  the 
differential  spike  contribution  of  single  neurons. 
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Figure  1: 

Cartoon  of  multiple  unit  shaqtness  of  amplitude  (A)  and  frequency  (B)  tuning 
maps  from  previous  studies  (Schreiner  and  Mendelson  1990,  Schreiner  et  al.  1992). 
Monotonicity  map  (A)  has  two  non-monotonic  (dark)  regions  sandwiched  between 
monotonic  (light)  regions.  Tlie  sharpness  of  frequency  tuning  (Q40dB)  map  has  a  sharply 
tuned  region  (dark)  which  gradually  gives  way  to  broader  frequency  tuning  (light).  Tlie 
ventral  non-monotonic  region  roughly  lines  up  with  the  sharply  tuned  region  (dashed 
line).  Both  maps  gradients  are  oriented  roughly  parallel  to  tire  isofrequency  domain.  SSS: 
suprasylvian  sulcus;  PES:  posterior  cctosylvian  sulcus;  AES:  anterior  cctosylvian  sulcus. 

Figure  2: 

Onc-prcscntatioii  FRA  (A)  and  coircsponding  spike  count  vs.  level  function  (B) 
for  a  non-monotonic  neuron.  Gap  in  dotted  background  in  (A)  represents  1/4  ocuive  band 
over  which  spikes  were  counted.  Two  spikc.s  on  the  low  frequency  side  at  55  dB  were 
outside  of  the  1/4  octave  band  and  thus  were  not  counted  for  the  spike  count  vs.  level 
plot.  The  maximum  of  0.8  spikes  per  repetition  seems  low  because  tire  cell  was  more 
aaiTowly  tuned  than  the  1/4  analysis  bin  used  (.see  Fig.  12  for  more  discussion  of  tliis 
mctliodological  effect).  The  monotonicity  ratio  is  calculated  by  dividing  the  response  at 
tlie  highest  tested  intensity,  72  dB,  by  die  largest  response.  The  spike  count  vs.  level  plot 
incorporates  data  from  another  1 -repetition  FRA  (not  shown)  from  dre  same  cell  which 
covered  from  2  to  72  dB  SPL.  Also  nodcc  that  die  frequency  range  in  this  example  is 
high  rcsoludon  (90  frequencies). 

Figure  3: 

Spike  count  vs.  level  functions  for  six  neurons.  All  spikes  from  an  FRA  arc 
counted  over  15  dB  (3  levels,  shown  by  horizontal  lines  connecting  the  3  data  collects  for 
each  point)  and  1/4  octave  (5  frequencies).  Some  are,  based  on  more  dian  1  repetition  of 
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the  FRA  procedure,  and/or  were  excited  by  less  than  a  1/4  octave  frequency  range.  Tite 
monotonicity  ratio  is  calculated  by  dividing  tlic  response  at  the  highest  tested  intensity  by 
the  largest  response. 

Figure  4: 

Spatial  distribution  of  monotonicity  ratio  witliin  isofrequency  domain  (7-9  kHz) 
for  cat  SUTC12.  In  (A)  the  multiple  unit  data  points  are  shown.  The  result  of  a 
weighting  average  spatial  smoothing  algorithm  on  the  data  points  is  shown  in  (B). 
Clusters  with  the.  same  dorsovcntral  coordinate  were  assigned  weights  of  1 .0.  Recording 
sites  witliin  0.10  millimeters  of  each  other  were  assigned  v/cights  of  0.75.  Multiple  units 
between  0.1 1  and  0.25  millimeters  were  assigned  weights  of  0,5,  and  neighbors  between 
0.26  and  0.50  lu.Ulimctcrs  were  assigned  weights  of  0.2.5.  The  weighted  average  was 
calculated  to  give  a  "smootlied"  value  for  each  datit  point.  The  two  minima  in  the 
function  arc  identified  by  dotuxl  lines.  The  distance  between  the  minima  then  was 
normalized  to  "3  millimeters"  (C)  which  in  this  case  expanded  the  real  distances  by  7%. 
Notice  that  in  (B)  tlie  map  i.s  less  than  5  millimeters  but  in  (C)  it  i,s  more  than  5.  For  all 
graphs  the  ventral  non-inonotonic  region  is  assigned  tlie  value  "3"  and  the  dorsal  non¬ 
monotonic  region  is  assigned  the  value  "0". 

Figure  5: 

Ulusirative  example  of  pooling  method  using  the  two  cases  shown  in  Fig  4.  The 
multiple  unit  maps  (connected  filled  circles)  witli  each  single  (crosses  for  cat  SUTC16 
and  open  diamonds  for  case  SUTC12)  and  multiple  unit  (open  circles)  arc  shown  for  two 
cases  (A  and  C).  Single  unit  points  (same  symbols  as  in  A  and  C)  aie  displayed  after 
normalization  of  locations  to  "3  normalized  millimeters"  from  the  multiple  unit  map  (B 
and  D).  Single  units  from  tlie  2  cats  are  superimposed  with  their  common  normalized 


Sutter  and  Schreiner:  Single  Unit  Intensity  Topography 


J659-3 


coordinate  system  (E).  Notice  that  the  point  in  All  (C  and  D)  is  not  included  in  final 
pooled  data 

Figure  6: 

Scattergram  of  pooled  monotonicity  ratio  data  for  (A)  multiple  units,  and  (B) 
single  units.  For  (A)  the  dashed  line  connecting  filled  rectangles  displays  the  mean 
monotonicity  ratios  of  multiple  units  averaged  over  0.5  or  1.0  millimeter  bins;  rectangles 
arc  located  at  the  center  of  the  bins.  One  half  millimeter  bins  were  used  if  at  least  ten 
neurons  were  located  within  the  bin,  otherwise  1.0  millimeter  bins  were  used.  The 
numbers  of  recordings  in  each  bin  are  displayed  at  the  top  of  each  graph.  A  similar  curve 
for  single  neurons  is  shown  in  Fig.  6B.  Mean  binned  results  lor  single  and  multiple  units 
with  magnified  monotonicity  ratio  scale  (C).  For  all  plots  vertical  dashes  lines  represent 
the  normalization  points  used,  i.e.,  center  of  the  dorsal  and  ventral  non-monotonic 
regions  as  determined  by  multiple  unit  mapping. 

Figure  7: 

Percentage  of  strongly  (black  filled)  and  intermediately  (hatched)  non-monotonic 
neurons  for  topographically  pooled  multiple  (A)  and  single  (B)  units.  Number  of  units 
pooled  for  each  bin  is  shown  by  n  value  on  top  of  each  plot. 

Figure  8; 

Ratio  of  spikes  conuibuted  from  non-monotonic  units  and  monotonic  units. 
Maximum  spike  count  per  stimulus  to  tones  (FRmax)  were  calculated  for  each  neuron. 
Within  each  bin,  FRmax  was  added  for  all  non-monotonic  neurons,  and  this  sum  was 
then  divided  by  the  equivalent  measure  for  monotonic  neurons.  Notice  that  a  maximum 
of  the  ratio  occurs  both  in  the  dorsal  and  ventral  non-monotonic  regions. 


Sutter  and  Schreiner:  Single  Unit  Intensity  Topography 


J659-3 


Figure  9: 

Threshold  data  pooled  across  animals.  Zero  and  three  millimeters  correspond  to 
two  non-monotonic  regions.  The  line  drawn  at  3  millimeters  for  all  other  figures  has 
been  omitted  to  allow  for  better  viewing  of  the  data.  Threshold  was  referenced  such  that 
for  each  animal  0  dB  corresponded  to  the  lowest  multiple  unit  threshold  encountered. 
Notice  the  minima  in  threshold  scatter  at  the  two  non-monotonic  regions  and  the 
threshold  minimum  at  the  ventral  non-monotonic  region.  Unclassifiablc  Monotonicity 
(open  boxes  in  A)  refers  to  cells  in  which  monotonicity  ratio  could  not  be  calculated 
because  less  than  50  dB  of  recording  range  was  achieved. 
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Figure  10: 

Mean  binned  monotonicity  ratio  pooled  relative  to  BW40min  and  BWlO/40  min 
as  described  in  Schreiner  and  Sutter  1992. 

Figure  1 1 : 

Percentage  of  strongly  (black)  and  intemtediatcly  (stippled)  non-monotonic  single 
(A,C)  and  multiple  (B,D)  units.  Topographical  pooling  performed  relative  to  BVMOmin 
and  BW10/40min. 

Figure  12: 


Habituation  effects  of  the  PSTH  method  for  neuron  whose  FRA  is  depicted  in 
Fig.  2.  Comparison  of  spike  count  vs.  level  plot  derived  from  50  repetition  CF  tone 
PSTHs  with  measures  derived  from  FRA’s  (A).  Notice  that  the  0.25  octave  window  used 
underestimates  spikes  per  presentation.  This  is  because  tiie  neurons  FTC  is  less  than  1/4 
octave  wide  (B W  is  approximately  0. 1 5  octaves).  The  50  repetition  PSTH  also 
underestimates  tlie  number  of  spikes  per  presentation  as  compared  to  the  random  order 
FRA  mctliod.  Tliis  is  probably  due  to  habituation.  When  only  the  first  5  repetitions  of 
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each  PSTH  are  counted  (B),  the  habituation  effect  disappears.  The  0.1  octavo  FRA 
method  and  the  PSTH  method  now  yield  comparable  results.  Notice,  though,  that  the 
descending  branch  (intensities  greater  than  20  dB)  is  less  sharply  sloped  for  the  FRA 
method  as  compared  to  PSTHs.  This  is  because  at  high  intensities,  to  which  the  CF 
response  is  substantially  reduced,  off-CF  responses  are  still  present  (see  Fig.  2). 
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Abstract 

We  have  previously  reported  the  results  of  behavioral  training  of 
three  cats  which  performed  an  auditory  discrimination  task  involving 
complex  stimuli  that  were  created  to  simulate  vowels  -  their  frequency 
spectrum  containing  various  peaks  (vowel  formants)  located  at  various 
spacings.  Their  performance  improved  gradually  and  steadily  to  a  final 
best  difference  threshold  (Keeling  et  al,  1994).  In  this  paper,  wc  describe 
how  the  behavioral  thresholds  correlated  with  the  electrophysiologically- 
measured  Q-40  values  of  overall  neural  responses,  indicating  that  a 
sharpening  in  tuning  was  related  to  the  degree  of  proficiency  in  the 
auditory  discrimination.  We  also  report  clectrophysiologica!  results  from 
three  cats  trained  with  a  slightly  different  stimulus,  from  whom  behavioral 
thresholds  could  not  be  obtained,  but  whose  cortical  representation  was 
changed  nonetheless.  The  data  imply  that  language  acquisition  involves 
dynamic  cortical  reorganization  of  speech  feature  representations  and  that 
human  linguistic  capabilities  evolved  from  basic  mammalian  auditory 
processing  capacities. 
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Speech  perception  is  one  of  the  most  elaborate  feats  performed  by 
the  human  brain,  and  while  a  number  of  models  of  speech  perception  have 
been  posited  and  electrophysiological  recording  of  cortical  neuronal 
response  to  various  acoustic  signals  has  been  performed,  only  speculative 
theories  about  the  general  physiological  processes  underlying  speech 
perception  have  so  far  been  possible.  Consider  the  processing  of  vowels. 
Vowel  formants 

Vowels  are  characterized  by  their  formants  -  frequencies  which  are 
elevated  in  amplitude  relative  to  other  frequencies  and  which  are  produced 
by  the  natural  resonances  of  the  vocal  tract  activated  during  production  of 
the  vowel  (indicated  as  FI,  F2,  F3,  etc.).  Formant  values  show 
considerable  variability  across  speakers,  however,  and  even  overlap  for 
some  vowels  (Peterson  &  Barney,  1952).  The  variability  across  speakers  is 
due  to  physical  differences  in  vocal  tract  size  but  also  to  a  speaker's  past 
experience,  mainly  his/her  particular  dialectical  background,  but  the  values 
may  vary  within  context  and  even  across  repetitions  as  well. 

In  light  of  this  variability  it  is  amazing  that  listeners  are  so  proficient 
at  identifying  vowels.  The  time  course  of  a  dialogue  between  two  speakers 
is  usually  quite  rapid  and  only  rarely  marked  by  mislabeling  and 
misunderstanding  of  words.  If  there  is  no  constant  code  between  absolute 
formant  frequency  values  and  a  psychological  linguistic  percept,  how  are 
vowels  perceived? 

Miller  (1989)  has  proposed  what  he  calls  an  "auditory-perceptual 
theory  of  phonetic  recognition".  The  theory  is  descended  from  the 
fonmant-ratio  theory  of  vowel  quality,  originally  proposed  by  Lloyd 
(1890),  who  stated  that  vowel  quality  depends  on  the  intervals  between  the 
formants,  not  their  absolute  values.  The  theory  relies  heavily  on  the 


logarithmic  frequency  scale  and  Miller  gives  a  detailed  description  of  the 
effectiveness  of  various  scales  (e.g.  mel.  Bark,  Koenig)  in  clustering  vowel 
data  (i.e,  plotting  formant  values  against  each  other  as  Peterson  &  Barney 
(1952)  had  done).  The  best  grouping  was  observed  by  using  the  logs  of 
ratios  of  formant  center  frequencies  measured  either  in  hertz  or  mels  (the 
unit  used  for  subjective  pitch).  Miller  further  Justifies  the  use  of  a  log 
frequency  scale  by  citing  Weber's  law  (namely  that  as  stimuli  are  increased 
by  multiplication,  sensations  increase  by  addition)  and  the  octave  basis  of 
mus’cal  scales.  The  theory  makes  use  of  formant  ratios  and  a  "sensory 
reference"  which  is  based  on  the  speaker's  average  fundamental  frequency. 
Sensory  and  perceptual  "paths"  describe  changes  in  spectral  patterns  of  the 
formants  during  the  course  of  an  utterance.  "It  is  proposed  that  a 
segmentation  mechanism  based  on  the  dynamic?;  of  these  spectral  patterns 
causes  perceptual  target  zones  to  issue  neural  symbols  or  category  codes 
that  correspond  to  the  vowel  sounds"  (p.2129). 

Chistovich  (1985)  has  reported  a  difference  beriveen  stimuli  with 
closely  spaced  formants  and  those  with  widely  spaced  formants  when 
amplitude  relations  between  the  two  formants  are  changed.  Closely  spaced 
formants  were  considered  to  be  those  with  a  distance  between  them  of  less 
than  3  -  3.5  bark  (roughly  equivalent  to  one  octave).  Their  perception  was 
closely  related  to  the  one-formant  stimulus  phonetically  most  similar  to  the 
two-formant  stimulus,  implying  a  loss  of  ability  to  discriminate  the  two 
peaks  within  this  critical  distance.  She  suggested  therefore  that  spatial 
integration  of  spectral  information  may  be  an  important  factor  in  the 
processing  of  vowels.  Chistovich  speculates  that  "there  exists  a  set  of 
detectors  tuned  to  specific  spectral  configurations.. .nearly  simultaneous 
occurrence  of  two  peaks  in  the  dynamic  spectrum  is  needed  to  excite  the 
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detectors  tuned  to  two-formant  shapes"  (p.803).  She  laments  the  absence  of 
neurophysiological  data  concerning  the  response  of  neurons  in  AI  of 
auditory  cortex  to  stimuli  with  complex  spectrum  shapes. 

The  ratios  of  the  formants  may  be  an  important  cue.  We  calculated 
the  ratios  of  the  first  and  second,  and  second  and  third  formants;  since 
frequency  is  represented  on  a  logarithmic  scale  along  the  basilar  membrane 
within  the  cochlea,  we  computed  the  ratios  of  the  formants  in  units  per 
octave  (Miller,  1989)  and  hypothesized  that  the  perception  of  variations  in 
the  position  of  spectral  peaks  is  dependent  on  the  spacing  of  the  peaks. 

Auditory  Cortex 

The  auditory  cortex  in  the  human  and  in  many  nonhuman  primates  is 
located  primarily  on  the  superior  temporal  gyrus  and  is  buried  for  the 
most  part  within  the  Sylvian  fissure  (Lass,  1988).  In  the  cat,  the  auditory 
cortex  represents  a  large  area  mostly  on  the  lateral  surface  of  the  brain. 

The  following  descriptions  will  refer  to  the  cat. 

On  the  basis  of  constant  cellular  characteristics  seen  with  Nissl  stain. 
Rose  (1949)  defined  primary  auditory  cortex  (termed  AI),  secondary 
cortex  (AH),  and  an  auditory  area  on  the  posterior  ectosylvian  gyrus  (Ep). 
Primary  auditory  cortex  was  described  as  cytoarchitecturally  similar  to 
other  primary  sensory  cortex,  with  six  layers  and  a  high  density  of 
pyramidal  and  granule  cells  in  layers  II,  III  and  IV.  The  ventral  division 
of  the  medial  geniculate  body  projects  to  primary  auditory  cortex,  the 
afferent  fibers  ending  mainly  in  layer  IV  (Winer,  Diamond,  & 

Raczkowski,  1977). 

Using  surface  evoked  potential  recordings,  Woolsey  and  Walzl 
(1942)  first  reported  a  tonotopic  organization  in  auditory  cortex.  In  1975, 
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Merzenich,  Knight,  &  Roth  refined  this  description  employing 
microelectrode  mapping  techniques.  They  reported  that  "any  given 
frequency  band  (or  sector  of  the  cochlear  partition)  is  represented  across  a 
belt  of  cortex  of  nearly  constant  width  that  runs  on  a  nearly  straight  axis 
across  AI  “(p.247)  ~  see  figure  1.  Also,  best  frequency  was  constant  within 
vertical  penetrations  into  AI  in  the  active  middle  and  deep  cortical  layers, 
i.e.  the  cells  appear  to  be  organized  in  columns.  Further,  "there  is  an 
orderly  representation  of  the  cochlea  within  the  field  rostral  to  AI,  with  a 
reversal  in  best  frequencies  across  its  border  with  AI".  Finally, 
"physiological  definitions  of  AI  boundaries  are  consistent  with  their 
cytoarchi tectonic  definition"  (ibid). 

Further  mapping  studies  (Knight,  1977;  Merzenich,  Roth,  Knight  & 
Colwell,  1977;  and  Reale  &  Imig,  1977)  have  shown  that  there  arc  at  least 
three  other  cochleotopically  organized  cortical  fields  in  cats,  besides  AI, 
namely,  the  anterior  auditory  field  (AAF),  which  borders  AI  rostrally,  the 
posterior  auditory  field  (PAF),  and  the  ventro-posterior  field  (VPAF)  -  see 
figure  2,  from  Merzenich,  Andersen,  &  Middlebrooks  (1979). 
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Figure  1.  An  example  of  a  eortical  map  showing  best  frequency 
as  a  function  of  distance  across  the  cortical  surface  (from 
Merzenich  et  al.,  1975). 


AUDITORY  CORTICAL  FIELDS 


Figure  2.  Schematic  representation  of  the  basic  organization  of 
auditory  cortical  fields  in  the  cat.  A1  -  primary  auditory  Held; 
AAF  >  a^*^terior  auditory  field;  PAF  •  posterior  auditory  field; 
VPAF  -  ventroposterior  auditory  field.  The  region  of 
representation  of  the  cochlear  apex  (a)  and  cochlear  base  (b)  are 
indicated  for  all  four  of  these  cochleotopically  organized  fields, 
(from  Merzenich  et  al.,  1979). 
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Another  organizing  feature  of  the  auditory  cortex  is  the  existence  of 
binaural  bands  roughly  orthogonal  to  the  iso-frequency  contours  (Imig  & 
Adrian,  1978;  Imig  &  Brugge,  1978;  Middlebrooks,  Dykes,  &  Merzenich. 
1980).  In  alternating  bands,  "neurons  are  predominantly  'excitatory- 
inhibitory'  (driven  by  contralateral  stimulation  with  driven  responses 
inhibited  by  ipsilateral  stimulation);  or,  in  the  second  set  of  bands, 
'excitatory-excitatory'  (driven  by  both  contralateral  and  ipsilateral 
stimulation;  or  driven  by  one  or  the  other  ear,  with  the  response  strongly 
facilitated  by  stimulation  of  the  other)"  (Merzenich  et  al.,  1979,  p.65).  It 
has  also  been  shown  that  discrete  frequency-band-specific  lesions  in 
unilateral  primary  auditory  cortex  produced  profound  deficits  of  sound 
localization  ability  in  the  contralateral  hemifleld  (Jenkins  &  Merzenich, 
1984). 

There  are  however,  some  features  of  signals  other  than  frequency 
and  laterality'  which  appear  to  be  represented  in  a  systematic  way  within 
auditory  cortex.  For  example,  sharpness  of  tuning  is  distributed  along  the 
iso-frequency  domain  of  AI.  Schreiner  &  Cynader  (1984)  have 
demonstrated  low  Q-10  dB  values  in  area  All.  Moving  dorsally,  the  Q-10 
dB  values  increased  gradually,  reaching  a  maximum  about  2  mm  dorsal  to 
the  AI/AII  border.  From  the  central  portion  of  AI  toward  the  Supra- 
Sylvian  sulcus,  the  Q-10  dB  values  gradually  decreased,  although  they 
remained  higher  there  than  in  AH.  This  gradient  in  sha.'pness  of 
along  the  dorso-ventral  extent  of  AI  was  confimied  by  Schreiner  & 
Mendelson  (1990),  the  most  dorsal  and  most  ventral  regions  of  AI  being 
the  most  broadly  tuned,  with  sharpest  tuning  in  the  central  area. 

Amplitude  modulated  signals  also  appear  to  be  represented  somewhat 
systematically  within  the  anterior  auditory  cortical  area  (Schreiner  & 
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Urbas,  1986,  1988).  Ai'.o,  non-monotonicity  of  rate/level  functions,  i.c. 
the  degree  of  change  of  firing  rate  as  a  function  of  level  increase,  has  been 
investigated,  and  it  appears  that  primary  auditory  cortex  contains  a 
topographic  representation  of  intensity  information  in  the  iso-frequency 
domain  (Schreiner,  Mendelson,  &  Sutter,  1992;  Heil  &  Scheich,  1991). 
Finally  the  direction  and  velocity  of  logarithmic  frequency  sweeps  are 
systematically  represented  in  AI  (Mendelson,  Schreiner,  Grasse,  &  Sutter, 
1988;  Mendelson  &  Grasse,  1992).  The  most  dorsal  region  of  AI  showed 
a  preference  for  sweeps  from  low  to  high  frequencies;  moving  ventrally, 
the  preferred  sweep  direction  gradually  changed  to  sweeps  from  high  to 
low  frequencies,  while  most  of  the  ventral  portion  of  AI  showed  no 
specific  sweep  direction  selectivity.  FM-speed  selective  responses  also 
displayed  a  systematic  spatial  organization,  again  progressing  as  a  function 
of  the  dorso-ventral  position  within  AI. 

Suga  (1984)  has  eloquently  demonstrated  the  systematic 
representation  of  different  types  of  biosonar  information  in  the  auditory 
cortex  of  the  mustached  bat.  He  shows  how  "complex  acoustic  signals  are 
processed  by  specialized  neurons  that  are  tuned  to  particular  information- 
bearing  parameters  (IBPs)  or  combinations  of  IBPs"  (p.315). 

Furthermore,  "the  biologically  more  important  values  of  an  IBP  are 
overrepresented  by  the  large  number  of  IBP  filters  tuned  to  them  in  order 
to  achieve  higher  resolution"  (ibid).  Thus  the  highly  refined  localization 
ability  of  this  animal  has  a  corresponding  highly  refined  cortical 
topographical  representation.  He  has  hypothesized  (1988)  that  speech 
recognition  is  similarly  based  upon  a  spatio-temporal  pattern  of  neural 
activity  occurring  in  various  cortical  areas. 
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Wollberg  &  Newman  (1972;  Newman  &  WoIIberg,  1973)  looked  at 
the  responses  of  cells  in  the  superior  temporal  gyrus  (STG)  of  awake 
squirrel  monkeys  to  a  variety  of  different  species-specific  vocalizations. 
Eighty-nine  percent  of  the  cells  sampled  responded  to  over  half  of  the 
vocalizations,  suggesting  that  "many  neurons  in  the  STG  do  not  select 
between  different  classes  of  vocalizations  according  to  presence  or  absence 
of  simple  acoustic  features"  (Newman  &  Wollberg,  1973,  p.287),  and  that 
vocalizations  are  encoded  in  a  highly  complex  way. 

Langner,  Bonke,  &  Scheich  (1981)  explored  neuronal  discrimination 
of  natural  and  synthetic  vowels  in  field  L  of  trained  mynah  birds.  Field  L 
is  a  layered  and  tonotopically  organized  primary  auditory  projection  area 
in  the  bird.  Only  a  minority  of  units  responded  selectively  to  one 
particular  vowel;  many  units  responded  to  several  vowels.  It  was  therefore 
speculated  that  vowel  recognition  may  be  based  on  populations  of 
simultaneously  activated  units. 

Steinschneider  et  al.  (1990)  recorded  activity  in  AI  in  an  awake 
monkey  to  3  consonant-vowel  syllables,  and  to  the  syllables'  isolated 
formants  and  formant  pairs.  They  reported  that  response  features  were 
related  to  the  tonotopic  organization  of  AI  and  that  formant  interactions 
seem  to  modulate  the  response  to  whole  syllables. 

Shamnia,  Fleshman,  Wiser,  and  Versnel  (1993)  studied  responses  in 
primary  auditory  cortex  of  the  ferret  to  direction  of  frequency-modulated 
tone  sweeps  and  to  spectrally  shaped  noise.  The  excitatory  and  inhibitory 
portions  of  the  response  area  were  described  in  terms  of  the  asymmetry  of 
the  excitation  and  inhibition  around  the  best  frequency  and  were  shown  to 
correlate  with  the  preferred  direction  of  FM  sweeps  and  with  spectral 
shapes.  The  authors  concluded  that  "cortical  responses  encode  the  locally 
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averaged  gradient  of  the  acoustic  spectrum  by  their  differential  distribution 
along  the  isofrequency  planes.  This  enhances  the  representation  of  such 
features  as  the  symmetry  of  spectral  peaks  and  edges  and  the  spectral 
envelope”  (p.367).  They  suggest  a  possible  analogy  in  AI  with  spatial 
frequency  "channels”  in  the  primar3'  visual  cortex. 

Schreiner,  Calhoun,  and  Keeling  (1993)  presented  ripple  stimuli  to 
cortical  neurons  in  the  cat.  The  ripple  spectra  that  were  generated  had  the 
following  characteristics:  the  carrier  consisted  of  a  harmonic  series  (FO 
ranging  from  50  to  200  Hz)  with  a  6  dB/octave  decline  of  the  component 
amplitudes  (120  to  255  components);  the  bandwidth  of  the  stimulus  was  3 
octaves;  the  spectral  envelope  of  the  signal  was  represented  by  a  sinusoid 
on  a  logarithmically  scaled  frequency  axis,  the  frequency  of  the  envelope 
sinusoid  was  referred  to  as  ripple  density  (ripples/octave);  the  modulation 
depth  of  the  envelope  (ripple  depth)  was  linear  on  a  dB  scale.  The 
gcornetiical  center  of  the  band-limited  signal  was  always  at  a  maximum  of 
the  sinusoidal  spectral  envelope.  The  center  was  positioned  at  the 
characteristic  frequency  (cO  of  each  cortical  neuron  and  the  ripple  density 
or  the  frequency  distance  between  spectral  peaks  was  systematically  varied. 
The  pjsulting  'ripple  transfer  function'  was  reconstructed  for  different 
modulation  depths  and  overall  intensities.  For  the  majority  of  neurons,  the 
ripple  transfer  function  was  a  bandpass  and  a  'best'  ripple  density  could  be 
defined.  The  remaining  transfer  functions  appeared  to  have  a  lowpass 
characteristic,  at  least  for  those  ripple  densities  used  (0.3  to  8 
ripples/octave).  Best  ripple  densities  ranged  from  0.6  to  4  ripples/octave 
with  a  mean  between  1  and  2  ripples/octave.  Spatial  mapping  of  responses 
to  ripple  spectra  along  the  isofrequency  domain  of  AI  revealed  a  systematic 
shift  of  the  best  ripple  density  in  multiple  unit  responses  from  central  AI  to 
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dorsal  AI.  The  shift  paralleled  the  previously  described  variation  of 
integrated  excitatory  bandwidth  with  sharply  tuned  locations  and  high  best 
ripple  densities  near  the  dorso-ventral  center  and  broader  tuning  curves 
with  lower  best  ripple  densities  toward  the  dorsal  end  of  AI.  Ventral  AI 
showed  a  less  systematic  distribution  of  ripple  densities.  The  hypothesis  of 
a  systematic  'spatial  frequency'  or  'spectral  envelope  frequency' 
representation  in  AI,  oriented  orthogonal  to  the  frequency  axis,  was 
supported. 

Thus,  one  secs  at  the  primary  auditory  cortical  level  a  rudimentary 
representation  of  spectral  and  temporal  features  of  speech  or  speech-like 
signals.  What  is  not  seen  is  selectivity  to  particular  calls,  or  syllables,  or 
vowels.  Rather  neural  responses  seem  to  be  evoked  by  discrete  stimulus 
components.  What  is  supported  is  the  idea  that  percepts  are  created  by 
patterns  of  neural  firing  across  the  length  and  breadth  and  probably  depth 
of  the  sensory  cortical  area. 

Plasticity  of  Auditory  Cortex 

Merzenich  and  colleagues  have  provided  evidence  that,  in  adult 
mammalian  somatosensory  cortex,  the  topographic  map  of  the  body  surface 
is  not  static,  and  that  receptive  fields  at  particular  cortical  sites  can  change 
in  size  and  location  throughout  adult  life;  the  cortical  locus  at  which  a 
given  skin  surface  is  represented  can  shift  several  hundreds  of  microns 
across  the  cortex  (Clark,  Allard,  Jenkins,  &  Merzenich,  1988;  Jenkins, 
Merzenich,  Ochs,  Allard,  &  Guic-Robles,  1990;  Allard,  Clark,  Jenkins,  & 
Merzenich,  1991).  They  have  suggested  that  "inputs  are  selected  on  the 
basis  of  temporal  coiTelation"  (Clark  et  al,  1988,  p.444),  i.e.  mechanisms 
which  are  correlated  in  time  may  underlie  cortical  representations  and 


their  modification.  Such  changes  also  appear  to  underlie  the  recovery  seen 
following  damage  to  the  brain  (Jenkins  &  Merzenich*  1987).  It  has  also 
been  shown  that  extended  use  of  a  monkey  digit,  in  a  discriminative  task 
for  which  reward  is  contingent  upon  behavior,  induces 
reorganization/modification  of  somatosensory  cortical  representation 
(Recanzone,  Jenkins,  Hradek,  &  Merzenich,  1992;  Recanzone,  Mcrzenich, 
Jenkins,  Grajski,  &  Dinse,  1992). 

In  auditory  cortex,  Robertson  and  Irvine  (1989)  have  shown  that 
frequency  organization  in  guinea  pig  reorganizes  following  partial 
unilateral  deafness.  In  cats,  the  cochleotopic  representation  in  primary 
auditory  cortex  (AI)  was  extensively  reorganized  following  neonatal, 
bilateral  high  frequency  cochlear  damage  and  hearing  loss  (Harrison, 
Nagasawa,  Smith,  Staton,  &  Mount,  1991).  In  owl  monkeys.  Recanzone, 
Schreiner,  and  Merzenich  (1993)  observed  plasticity  in  the  frequency 
representation  of  primary  auditory  cortex  following  discrimination 
training.  An  increase  in  the  cortical  area  of  representation  of  those 
frequencies  which  were  behaviorally  relevant  to  the  monkeys  was 
observed.  The  sharpness  of  tuning  was  increased,  as  was  the  latency  of  the 
cortical  responses. 

Thus,  auditory  cortical  representation  of  acoustic  stimuli  is  not  static, 
but  changes  with  changing  relevance  of  the  stimuli.  This  dynamic 
adjustment  of  cortical  response  appears  to  underlie  the  learning  process,  as 
well  as  recovery  following  injury. 

Rationale 

Thus,  the  perception  of  vowels  involves  the  processing  of  peaks  of 
spectral  energy,  the  frexjuencies  of  which  are  located  at  various  distances  or 
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spacings.  It  was  decided  to  create  a  vowel-like  "ripple"  stimulus,  which 
could  be  easily  manipulated  to  allow  the  presentation  of  various  parameters 
of  the  basic  stimulus.  The  main  parameter  of  interest  was  to  be  the  peak 
spacing  or  ripple  density.  Holding  the  ripple  density  constant,  in  each  test 
condition,  the  phase  of  the  envelope  of  the  stimulus  was  to  be  shifted  so  that 
the  locations  of  the  peaks  changed.  The  precision  with  which  the  peak 
shifts  could  be  determined,  and  the  variation  of  this  precision  across 
several  ripple  densities  was  expected  to  shed  light  upon  the  importance  of 
small  changes  in  formant  location  for  vowel  processing.  An  operant 
conditioning  paradigm  was  employed  to  obtain  a  threshold  for 
discriminability  of  phase  shifts  of  a  ripple  stimulus  in  cats. 

It  was  postulated  that  at  the  primary  auditory  cortical  level,  a  vowel¬ 
like  ripple  spectrum  would  be  represented  on  the  basis  of  the  frequency  of 
the  dominant  peaks  in  conjunction  with  a  representation  of  formant  ratio  or 
peak  spacing  as  processed  by  the  filter  function  (sharpness  of  tuning)  of  the 
driven  neurons.  It  was  further  postulated  that,  as  for  plastic  changes  seen 
in  primate  cortex  consequent  to  behavioral  training,  the  cortical 
representation  of  behavioral  stimuli  would  be  different,  reflecting  more 
efficient  processing  of  these  relevant  stimuli,  in  the  cortex  of  trained 
animals  as  compared  to  that  in  untrained  animals. 
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METHODS 


Eleclrophysiological  recording  from  primary  auditory  cortex  was 
performed  in  six  trained  cats,  as  well  as  in  several  normal  un-lrained  cats. 
The  cats  were  injected  intra-muscularly  with  acetylpromazine  maleate  (.09 
mg/Kg)  and  ketamine  hydrochloride  (10  mg/kg).  When  a  cat  was 
sufficiently  sedated,  it  was  shaved  along  the  inside  of  the  two  forelimbs 
(for  IV  cannulation),  along  the  throat  area  (for  insertion  of  tracheal  tube), 
and  over  the  posterior  and  lateral  area  of  the  head  (for  temporal  cortex 
exposure).  Venous  cannulation  was  performed;  anesthesia  was  induced  with 
an  initial  dose  of  pentobarbital  sodium  (30  mg/kg).  Animals  were 
maintained  at  a  surgical  level  of  anesthesia  with  a  continuous  infusion  of 
pentobarbital  sodium  (2  mg  •  Kg"^  •  h'l)  in  lactated  Ringer  solution 
(infusion  volume:  3.5  ml/h)  and,  if  necessary,  with  supplementary 
intravenous  injections  of  pentobarbital  sodium.  The  state  of  anesthesia  was 
monitored  and  the  rate  of  infusion  was  adjusted  to  maintain  an  a  reflexive 
state.  The  cats  were  also  given  dexamethasonc  sodium  phosphate  (0.14 
mg/Kg  im)  to  prevent  brain  edema  and  atropine  sulfate  (1  mg  im)  to  reduce 
salivation.  A  tracheotomy  was  performed  and  a  tracheal  tube  inserted  to 
ease  breathing  and  to  reduce  breathing  noises.  The  temperature  of  the 
animal  was  monitored  with  a  rectal  temperature  probe  and  maintained  at 
37.5  C  with  the  use  of  a  heating  water  blanket  with  feedback  control.  The 
EKG  was  continuously  displayed  on  an  oscilloscope  and  amplified  through 
a  loudspeaker. 

The  cat's  head  was  placed  in  a  standard  rigid  mouth-bar  headholder, 
an  incision  made  in  the  scalp  down  the  center  from  forehead  to  occiput, 
the  right  temporalis  muscle  retracted,  and  the  right  cortical  surface  (right 


side  for  convenience  due  to  stimulation  and  recording  set-up)  exposed 
by  drilling  through  the  skull  in  the  region  of  the  middle  ectosylvian  gyrus. 
The  dura  overlying  the  middle  ectosylvian  gyrus  was  removed,  the  exposed 
cortex  covered  with  silicone  oil,  and  a  video  image  of  the  surface 
vasculature  was  obtained  with  a  CCD  camera,  an  image  capture  board 
(Data  Translation  DT2255)  and  capture  software  (Image  1.4,  NSCA). 
Electrode  f>enetration  sites  were  marked  on  the  image  of  the  cortical 
surface  in  a  display  program  (Canvas,  Deneba). 

Experiments  were  conducted  in  a  sound-shielded  room  (lAC). 
Auditory  stimuli  were  presented  via  calibrated  headphones  (STAX  54) 
enclosed  in  small  chambers  that  were  connected  to  sound  delivery  tubes 
placed  into  the  acoustic  meati  (Sokolich,  US  Patent  4,251,686;  1981).  The 
sound  delivery  system  was  calibrated  with  a  sound  level  meter  (Bruel  & 
Kjaer)  and  a  waveform  analyser  (General  Radio  152 1-B).  Auditory 
stimuli  were  digitally  generated  by  a  signal  processing  computer 
O'MS32010)  and  converted  into  analog  signal  by  a  16-bit  digita!-to-analog 
converter  running  at  a  60  kHz  sampling  rate.  Additional  attenuation  was 
provided  by  a  pair  of  passive  attenuators  (Hewlett  Packard). 

For  each  recording  site  responses  were  firstly  recorded  to  at  least 
675  different  tone  bursts.  Tone  bursts  were  presented  in  a  pseudorandom 
sequence  of  different  frequency-level  combinations  selected  from  15  level 
values  and  45  frequency  values.  From  the  responses  to  all  stimuli, 
frequency  response  areas  (FRAs)  were  reconstructed.  Steps  between  levels 
were  5  dB,  resulting  in  a  sampled  dynamic  range  of  75  dB.  The  frequency 
range  covered  by  the  45  frequency  steps  was  geometrically  centered 
around  the  estimated  cf  of  the  recording  site  and  covered  3  to  5  octaves, 
depending  on  the  estimated  width  of  the  FRA.  Stimulus  frequencies  were 


equidistant  on  a  logarithmic  frequency  scale.  Following  the  presentation  of 
pure  tones  and  the  generation  of  the  FRA,  ripple  stimuli  with 
characteristics  similar  to  the  stimuli  used  in  animal  behavio'^al  testing  were 
created  and  presented:  bandwidth  was  3  octaves,  fundamental  frequency 
was  approximately  69  Hz,  ripple  densities  (10  to  14  values)  of  from  0.5  to 
8  ripples/octave  were  presented,  envelope  phase  shifts  (10  values)  of  from 
0  to  180  degrees  were  presented.  See  figure  3. 


Amplitude  (dB) 


0  degree  phase  shift 
180  degree  phase  shift 
90  degree  phase  shift 


Frequency  (kHz) 


Figure  3.  Ripple  Stimuli^:;,  showing  0  degree,  180  degree,  and 
''»0  degree  phase  shift.  Corresponding  frequency  locations  are 
sh  >vt'n:  3.675,  2.6,  and  3.1  kHz.  Frequencies  lo  the  right 
rcpreseitt  phase  shifts  of  negative  90  and  negative  180  degrees, 
at  locations  of  4.3  and  5  1  kHz.  Ripple  density  =  1 
ripplc/octave. 
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Primary  auditory  cortex  (AI)  was  identified  using  cf  information, 

■’  ‘ ' 

1°  • 

a.  .« 

high  frequencies  (20  kHz)  being  represented  anteriorly  and  low  frequencies 

(2  -  4  kHz)  posteriorly  (Merzenich  et  al.,  1975)  and  using  sulcal/gyra! 

landmarks.  Multiple  unit  recording  was  performed  using  parylene  coated 
tungsten  electrodes  with  impedances  of  1.0  to  2.0  MOhm  at  1  kHz;  in  a  few 
cases,  single  unit  responses  were  recorded  with  glass-coated  tungsten 

jf  ;  ^ 

e'''f’trodcs  (3-4  MOhm  at  1  kHz).  The  electrode  was  introduced  into  the 

•'  i 

cortex  approximately  orthogonal  to  the  surface  (as  viewed  through  a  2^iss 

operating  microscope);  penetrations  were  parallel  to  each  other.  The 

‘.A  •> 

electrode  was  advanced  through  the  cortical  layers  to  a  depth  of  600  -  1000 

'!•  ' 

/rm,  corresponding  to  cortical  layers  III  and  IV,  using  a  hydraulic 

• 

h  ■ 

microdrive  (KopO  remotely  controlled  by  a  stepping  motor.  Activity  of 

f  ■' 

small  groups  of  neurons  was  amplified,  band-pass  filtered  (1-10  kHz,  12 

t ' 

dB  octave),  and  monitored  on  an  oscilloscope  and  an  audio  monitor.  The 

i'  .-'  ' 

discriminator  (BAK  DIS-1)  level  was  set  to  exclude  evoked  potentials  and 

•'i  •  '■* 

to  accept  events  that  resembled  action  potentials  of  an  amplitude  at  least 

'X  ' 

50%  above  the  background  signal.  The  number  of  events  per  presentation 

and  the  arrival  time  of  the  first  event  after  the  onset  of  the  tone  bursts  were 

'Jf  '  ■ 

S'-. 

recorded  and  stored  in  a  computer  (DEC  1 1/73).  The  recording  window 

•.  -■*. ' 

^  • 

had  a  duration  of  50  ms,  corresponding  to  the  stimulus  duration. 

Poststimulus  time  histograms  (PSTHs)  were  constructed  in  response  to  the 

r  '■  ■ 
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A  general  response  profile  of  cells  was  assessed  including 

spontaneous  discharge  rate,  characteristic  frequency  (cf)  (the  stimulus 

,./• 

frequency  with  the  lowest  sound-pressure  level  necessary  to  evoke  neuronal 

br  '  *  »  « 

activity),  threshold  (the  lowest  level  evoking  activity  in  the  FRA), 

rC"' 

bandwidth  approximately  10  dB  above  threshold  (to  compute  Q-10,  which 

'.'■X 
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is  cf  divided  by  bandwidth  10  dB  above  threshold),  bandwidth  40  dB  above 
threshold  (to  compute  Q^,  which  is  cf  divided  by  bandwidth  40  dB  above 
threshold),  and  in  some  cases  binaural  interaction  class.  Follov/ing 
electrophysiological  testing  animals  were  sacrificed  with  an  overdose  of 
pentobarbital  sodium;  bilateral  thoracotomy  was  performed. 
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RESULTS 


Several  (eighteen)  normal  "untrained"  cats  provided  data  for  this 
thesis.  The  recording  protocol  changed  slightly  over  time  as  information 
about  neural  responses  was  gathered.  Thus,  not  all  untrained  cat  data  are 
included  in  all  analyses.  Three  cats,  #218,  293,  and  176  were  behaviorally 
trained  and  tested  on  a  stimulus  with  a  ripple  density  of  1  ripple/octave; 
they  underwent  recording  at  the  end  of  behavioral  testing.  Three  other 
cats  (//92-1767,  CD184,  and  92-1688)  were  behaviorally  trained  and  tested 
on  a  stimulus  with  a  ripple  density  of  two  ripples/octave.  Two  "exposed" 
cats,  #122,  and  40764,  i.e.  cats  that  had  been  placed  on  the  training 
procedure  but  had  been  unable  to  learn  the  discrimination  paradigm,  also 
provided  electrophysiological  data. 

Ripple  Density  Representation  in  Cortex 

Ripple  stimuli  of  various  densities  were  presented  during 
electrophysiological  recording  to  both  trained  and  untrained  cats.  In  this 
section  of  the  results,  the  spike  counts  recorded  in  the  post-stimulus 
histograms  were  normalized,  that  is,  for  the  responses  to  the  ripple 
stimulus,  the  best  response  was  designated  as  100%  and  the  smallest  0;  all 
other  responses  were  computed  as  a  percentage  between  0  and  100.  In  this 
way,  variation  in  the  responses  to  the  ripple  could  be  more  easily  observed, 
and  the  responses  could  be  compared  to  one  another,  and  across  animals. 


The  "neural  response"  was  defined  as  ilie  normalized  spike  count  averaged 
across  all  penetrations,  and  described  as  a  function  of  the  ripple  density  of 
the  stimulus.  For  electrophysiological  response  recording  to  different 
ripple  densities,  the  ripple  stimulus  was  centered  on  the  cf  of  the  multi-unit 
penetration,  then  various  ripple  densities  (0.5  to  8  iripples/octave)  were 
presented. 


The  electrophysiological  results  for  cats  trained  with  a  ripple 
density  of  1  ripple/octave  will  be  discussed  initially;  untrained  data  will 
then  be  presented;  the  results  from  these  trained  and  the  untrained  animals 
will  then  be  compared;  finally  the  data  from  all  6  trained  animals  will  be 
compared  with  the  results  from  untrained  animals. 

Figure  4a  shows  spike  counts  averaged  across  all  penetrations  as  a 
function  of  ripple  density  for  the  first  trained  cat,  #218.  The  total  number 
of  spikes,  recorded  over  thirty  repetitions  of  the  ripple  stimulus 
presentation,  during  the  30  ms  following  response  onset,  across  all 
penetrations,  for  each  ripple  density,  were  added  and  averaged.  The  best 
response  occurred  to  a  ripple  density  of  1  ripple/octave.  This  function  may 
be  described  as  a  cosine  ripple  transfer  function  (cosine,  because  it 
represents  the  stimulus  presented  only  at  a  phase  of  0  degrees;  a  complete 
description  of  the  ripple  transfer  function  also  requires  the  response  to  the 
stimulus  at  phase  90  degrees).  The  bandwidth  halfway  between  the  peak 
and  the  lowest  value  within  the  function  may  be  measured.  In  this  case,  the 
band'..'idth  50%  down  from  the  peak  is  2.33  ripples/octave. 

Figure  4b  illustrates  the  neural  response  as  a  function  of  ripple 
density  for  the  second  trained  cat,  #293.  In  this  animal  the  best  response 
was  at  a  ripple  density  of  0.33  ripples/octave.  The  bandwidth  50%  down 
from  the  peak  was  1.0  ripple/octave. 

Figure  4<'  illustrates  the  neural  response  as  a  function  of  ripple 
density  for  the  third  trained  cat,  #176.  The  best  response  was  at  a  ripple 
density  of  1  ripple/octavc  (0.33  ripple  density  is  quite  close;  the  peak  at 
4.66  was  the  neural  response  to  the  ripple  density  presented  first  in  the 
series).  The  bandwidth  50%  down  from  the  peak  was  1.66  ripples/octavc. 
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Figure  4d  illustrates  the  neural  response  as  a  function  of  ripple 
density  averaged  across  these  three  trained  cats.  The  best  response  was  at  a 
ripple  density  of  1  ripple/octave.  The  bandwidth  50%  down  from  the 
peak  was  1.75  ripples/octave. 
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Figure  4  a)  Nesirai  response  as  a  function  of  ripple  density  for 
Cat  #218,  and  1>)  Neural  response  as  a  function  of  ripple  density 
for  Cat  nn. 
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Figure  4  c)  Neura!  Response  as  a  function  of  ripple  density  for 
Cat  #176,  and  d)  Neural  Response  as  a  function  of  ripple  density 
averaged  across  all  three  Tri..ined  Cats. 
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Figure  5  a,b,c,d,e,  and  f  presents  similar  functions  for  five  untrained 
cats  (424,  188,  1681 ,  1673,  and  92-1700)  and  for  the  average  of  the  five 
untrained  cats. 

Table  1  presents  the  peak  responses,  bandwidths  50%  down  from  the 
peak,  number  of  microelectrode  penetrations  ('units'),  and  range  of 
characteristic  frequencies  (cfs)  included  in  the  neural  response  for  each  of 
the  three  trained  cats  and  for  the  five  un-trained  cats. 


cat  # 

training 

status 

r.d. 

peak 

response 

bandwidth 

50% 

down 

number 

units 

range 
of  cfs 
(kHz) 

218 

Trained 

1 

2.33 

45 

2.3  to  9 

293 

Trained 

0.33 

1.2 

77 

2.3  to  11,8 

176 

Trained 

1 

1.66 

50 

2.7  to  11.7 

681 

Untrained 

0.66 

1.0 

15 

1.9  to  11.5 

1673 

Untrained 

0.33 

1.0 

17 

2.6  to  7.2 

92-1700 

Untrained 

0.33 

0.5 

11 

4.3  to  13.4 

424 

Untrained 

0.66 

1.33 

18 

1.5  to  7.9 

188 

Untrained 

0.33 

1,0 

26 

I.l  to  9.4 

Table  1.  Bandwidths  at  Best  Ripple  Response  for  Trained  and 
Untrained  Cats 


Performing  a  Mann-Whitney  U  two-group  un-paired  comparison  of 
the  trained  animals'  bandwidths  to  those  of  the  five  untrained  cats  reported 
a  significant  difference  (p=0.05).  That  is.  the  bandwidth  50%  down  from 
the  best  response  to  a  particular  ripple  density  was  wider  in  trained  cats 
than  in  untrained  cals. 
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Figure  Sa)  Neural  response  as  a  function  of  ripple  density  for 
Cat  #681,  and  b)  Neural  response  as  a  function  of  ripple  density 
for  Cat  #1673. 
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Figure  5  c)  Neural  response  as  a  function  of  ripple  density  for 
Cat  #92-1700,  and  d)  Neural  response  as  a  function  of  tipple 
density  for  Cat  #424. 
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ripple  density 

Figure  5  e)  Neural  Response  as  a  function  of  ripple  density  for 
Cat  #188,  and  f)  Neural  Response  as  a  function  of  ripple  density 
averaged  across  five  Untrained  Cats. 


Figure  6  presents  a  comparison  of  the  neural  response  as  a  function 
of  ripple  density  for  the  average  of  three  trained  cats  and  the  average  of 
five  untrained  cats.  Performing  a  Wilcoxon  Signed-Rank  two-group 
paired  comparison  reported  a  difference  significant  at  a  level  of  P--O.OOI. 
Thus,  the  response  to  ripple  densities  was  different  in  the  trained  cats  as 
compared  to  the  untrained  cats.  In  trained  cats,  the  best  response  was 
recorded  in  response  to  a  higher  ripple  density  than  the  best  response  in 
untrained  cats.  Overall,  i.e.,  across  all  ripple  densities  presented,  the 
responses  were  on  average  higher  in  the  trained  cats,  except  for  the  best 
response  ripple  density  in  untrained  cats. 

Thus,  untrained  cats  showed  the  highest  response  to  ripple  densities 
of  either  0.66  or  0.33  ripples/octave,  with  the  mean  best  ripple  response 
being  at  a  ripple  density  of  0.33  ripples/octave.  For  three  cats  trained  on  a 
ripple  density  of  1  ripple/octave,  the  highest  response  was  at  either  1  or 
0.33  ripples/octave,  with  the  mean  best  ripple  response  being  at  a  ripple 
density  of  1  ripple/octave.  The  bandwidth  of  the  ripple  transfer  function 
50%  down  from  the  peak  response  was  wider  in  these  trained  cats  as 
compared  to  untrained  cats. 

Figure  7  illustrates  the  ripple  transfer  function  for  the  cats  trained 
on  a  ripple  density  of  1  ripple/octave,  as  above,  as  well  as  the  three  cats 
trained  on  a  ripple  density  of  2  ripples/octave,  compared  to  the  ripple 
transfer  function  for  the  average  re.sponse  of  8  untrained  cats. 
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Figure  6.  Comparison  of  Ripple  Transfer  Function  for  Trained 
Cats  'VS  Untrained  Cats 
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Figure  7.  Comparison  of  Ripple  Transfer  Function  for  Trained 
Cats  (rd  =  1,  AND  rd  =  2)  vs.  Untrained  Cats 
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Tuning 

The  sharpness  of  tuning  of  the  excitatory  response  areas  of  the 
penetrations  for  untrained  and  trained  cats  were  measured  and  compared. 
The  Q-40  value  is  calculated  by  dividing  the  characteristic  frequency  (cf)  at 
the  particular  penetration  by  the  bandwidth  40  dB  above  the  threshold.  Q- 
40  values  for  only  penetrations  with  cfs  below  1 1  kHz  were  included  in  this 
analysis. 

Figure  8  displays  the  regression  of  the  Q-40  values  against  the  cf  at 
each  penetration  for  the  untrained  cats.  Figure  9  displays  the  equivalent 
regression  for  the  trained  cats.  In  these  figures,  there  are  139  values  for 
the  three  trained  cats,  and  155  values  for  seven  un-trained  cats.  In  both 
cases,  there  is  no  relationship  between  the  Q-40  value  and  the  cf  of  the 
penetration,  indicating  that  the  sharpness  of  tuning  is  independent  of  the 
frequency  of  the  penetration.  . 
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Figure  9.  Regression  of  the  Q-40  values  against  the  cf  at  each 
penetration  for  the  trained  cats. 
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Figure  10  illustrates  the  distribution  of  the  same  Q-40  values  for  the 
three  trained  cats  and  for  seven  untrained  cats,  including  139  values 
for  the  three  trained  cats,  and  155  Q-40  values  for  seven  un-trained  cats. 

A  Mann-Whitney  U  two-group  un-paired  comparison  was  performed;  the 
two  groups  of  animals,  trained  and  untrained  had  different  Q-40  values  at  a 
significance  level  of  p=0.004.  The  mean  Q-40  value  for  the  untrained  cats 
was  1.93,  with  standard  deviation  of  1.4;  for  the  trained  cats  the  mean  Q- 
40  value  was  1.91,  with  standard  deviation  of  2.2. 


Q  40  values 
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Figure  10.  Distribution  of  Q-40  values  for  trained  and 
untrained  cats. 


Figure  11.  Distribution  of  Q-40  values  for  three  trained  cats. 
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Figure  1 1  illustrates  the  distribution  of  Q-40  values  for  the  three 
trained  cats.  The  cat  with  the  best  (lowest)  ripple  envelope  phase  shift 
threshold  (18  degrees),  #293,  had  the  highest  Q-40  values;  the  cat  with  a 
threshold  of  33  degrees,  #176,  had  the  next  highest  Q-40  values;  and  the  cat 
with  the  poorest  threshold  (80  degrees)  had  the  lowest  Q-40  values.  The 
average  QAO  values  and  the  behavioral  thresholds  for  each  of  the  trained 
cats  are  shown  in  the  table  below. 


Cat 

Q-40 

behavioral 

threshold 
(la  degrees) 

218 

1.37 

80 

293 

2.09 

18 

176 

2.36 

33 

Table  2.  Q-40  values  and  behavioral  thresholds  for  trained  cats 
The  average  Q*40  values  for  the  two  "exposed"  cats,  #122  and  40764,  i.e. 
cats  which  had  been  exposed  to  the  ripple  stimulus  for  approximately  two 
and  four  months  respectively,  but  which  were  unable  to  learn  the  single 
speaker  procedure,  were  1.75  and  1.61.  The  correlation  between  the 
average  Q-40  value  for  each  of  the  three  trained  cats  and  its  behavioral 
threshold  was  -.88,  with  an  R-squared  value  of  .77.  A  regression  analysis 
of  the  trained  cats'  Q40  values  and  their  behavioral  thresholds  is  shown  in 
Figure  12,  Due  to  the  small  sample  size,  the  data  do  not  allow  a 
statistically  secure  p-value  conclusion;  however  the  R-squared  value  of 
0.77  indicates  that  77%  of  the  variance  in  the  data  is  explained  by  the 
relationship  between  the  sharpness  of  tuning  and  the  behavioral  phase  shift 
thresholds. 


Q^O  values 


f  igure  12.  Kegression  analysis  for  the  >  .'alned  cats*  Q-40  values 
and  their  behavioral  thresholds. 
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Schreiner  &  Mendelson  (1990)  observ'cd  a  frequency-independent 
maximum  in  sharpness  of  tuning  for  both  Q-10  and  Q-40  values  located 
near  the  center  of  the  dorsoventral  extent  of  AI.  A  representative  pseudo- 
three-dimensional  projection  of  the  spatial  distiibution  of  Q-40  values  in 
AI,  taken  from  their  paper,  is  shown  in  Figure  13.  In  order  to  be  certain 
that  the  Q-40  values  of  the  trained  cats  were  not  penetrations  recorded  only 
at  the  very  central  portion  of  AI,  three-dimensional  representations  of  the 
spatial  distribution  of  Q-40  values  for  the  three  trained  cats  were  created. 
They  are  shown  in  Figure  14.  The  highest  Q-40  values  do  not  appear  to  be 
concentrated  in  the  central  portion  of  AI.  This  observation  lends  support 
to  the  idea  that  the  training  of  the  animals  affected  the  shatpness  of  tuning, 
as  measured  by  the  Q-40  values. 


Figure  13.  A  representative  pseudo-three-dimensional 
projection  of  the  spatial  distribution  of  Q-40  values  in  A1  (from 
Schreiner  &  Mendelson,  1990).  A  maximum  is  seen  in  the 
center  of  the  dorso-ventral  extent. 


Figure  14  a).  Three-dimensional  representation  of  the  spatial 
distribution  of  Q-40  values  for  trained  cat  #218. 
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Figure  14  b)  Three-dimensiouai  representation  of  the  spatial 
distribution  of  Q-40  values  for  trained  cat  #293. 


Figure  14  c)  Three-dimensional  representation  of  the  spatial 
distribution  of  Q-40  values  for  trained  cat  #176. 
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DISCUSSION  &  CONCLUSIONS 

The  results  of  this  research  have  several  implications.  The  breadth 
of  the  subject  matter  bespeaks  the  need  for  an  integration  of  several  lines  of 
investigation,  to  most  effectively  answer  questions  about  brain  function  and 
perception  and  language. 

Cortical  Representation 

Ripple  densities  were  represented  differently  in  the  auditory  cortices 
of  the  trained  cats  as  compared  to  untrained  cats.  The  peak  of  the  ripple 
transfer  function  was  shifted  slightly  toward  higher  ripple  densities  (1 
ripple/octave  compared  to  0.33).  The  best  ripple  density  peak  response  for 
untrained  animals  was  somewhat  lower  than  the  mean  best  ripple  density 
reported  in  Schreiner  et  al.  (1993):  0.33  as  compared  to  1  to  2 
ripples  'octave.  That  may  be  partly  due  to  the  fact  that  most  of  the 
Schreiner  et  al.  (1993)  data  were  collected  as  single  units,  whereas  the 
majority  of  the  data  reported  for  the  untrained  cats  in  the  present  study  was 
collected  as  the  response  of  small  clusters  of  units  (2-6  neurons),  which 
may  be  less  precise. 

In  hindsight,  it  would  have  been  more  telling  to  have  trained  the 
animals  on  a  ripple  density  other  than  1  ripple/octavc,  as  untrained  cats 
already  show  their  best  response  to  ripple  densUies  in  the  range  of  about 
.33  to  approximately  1  ripple/octave.  However,it  appears  to  be  more 
difficult  for  cats  to  leam  the  discrimination  at  higher  ripple  densities;  it 
was  not  possible  to  obtain  a  behavioral  threshold  at  higher  ripple  densities. 

The  bandwidth  50%  down  from  the  peak  was  wider  in  trained  cats 
than  in  untrained  cats  .  A  broadening  of  the  filter  function  implies  that  a 
larger  number  of  spatial  frequencies  (ripple  densities)  were  proce.ssed 
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more  efficiently  by  the  trained  cats  as  compared  to  the  untrained  cats. 

Since  three  trained  cats  were  only  trained  on  a  ripple  density  of  1 
ripple/octave,  this  increased  processing  efficiency  was  not  due  to  a  direct 
learning  effect.  What  may  have  happened  was  an  improvement  in  peak 
spacing  discrimination  that  generalized  somewhat  so  that  spacings  close  to  1 
ripple/octave,  though  not  challenged,  were  affected  in  a  facilitory  manner. 

Q-40  values  over  all  penetrations  were  higher  in  trained  cats  as 
compared  to  untrained  cats,  indicating  a  general  sharpening  of  frequency 
response  tuning.  The  behavioral  threshold  was  correlated  (-.87)  with  Q-40 
values,  indicating  that  the  sharpening  in  tuning  was  related  to  the 
proficiency  of  auditory  discrimination.  Recanzone  et  al.  (1993)  reported  a 
sharpening  of  tuning  for  behaviorally  relevant  frequeiicies  in  monkeys 
trained  to  discriminate  small  differences  in  the  frequencies  of  sequentially 
presented  tonal  stimuli.  In  this  study,  there  were  not  enough  penetrations 
with  cf  at  the  center  frequency  of  the  behavioral  stimulus  to  compare  the 
tuning  there  to  the  tuning  at  other  frequency  locations,  nor  would  we 
expect  a  simple  frequency  contrast  to  code  this  discrimination.  The  fact 
that  the  overall  tuning  was  sharpened  may  indicate  a  more  global  effect  for 
this  complex  stimulus. 

In  the  Recanzone  study,  the  sharpness  of  tuning  was  not  correlated 
with  behavioral  performance;  however  the  cortical  representation  of  the 
behaviorally  relevant  frequencies  was  increased  and  its  area  was  correlated 
with  behavioral  performance.  In  the  current  study,  a  best  ripple  density 
response  at  each  penetration  was  measured,  and  an  increase  in 
representation  of  best  ripple  density  of  1  ripple/octave  was  not  consistently 
seen  in  all  trained  animals  as  compared  to  untrained  animals.  However, 
although  there  is  some  indication  of  a  spatial  organization  for  ripple 


density  representation  orthogonal  to  the  frequency  axis  (Schreiner  ct  al., 
1993),  this  representation  is  not  presently  as  circumscribed  as  is  the 
representation  for  pure  tones.  A  larger  n  of  trained  animals  might  reveal  a 
reliable  change  in  this  representation. 

Language  learning 

Kuhl,  Williams,  Lacerda,  Stevens,  &  Lindblom  (199?)  observed  that 
exposure  to  a  specific  language  in  the  first  six  months  of  life  alters  infants' 
phonetic  perception.  Infants  in  America  and  Sweden  were  tested  with  both 
native-  and  foreign-language  vowel  sounds.  "Infants  from  both  countries 
exhibited  a  language-specific  pattern  of  phonetic  perception"  (p.606).  As 
pointed  out,  this  perceptual  organizing  occurs  well  before  the  age  at  which 
children  begin  to  acquire  word  meanings,  i.e.  well  before  the  acquisition  of 
language.  It  was  suggested  that  "linguistic  experience  shrinks  the 
perceptual  distance  around  a  native-language  prototype,  in  relation  to  a 
nonprototype,  causing  the  prototype  to  perceptually  assimilate  similar 
sounds"  (p.6CS).  Tiius  phonetic  prototypes  were  conjectured  to  be 
"fundamental  perceptual-cognitive  building  blocks"  (p.608). 

The  data  in  this  report  support  the  idea  that  certain  basic  auditoty 
processing  capabilities  are  operative  in  humans  and  in  other  mammals  as 
well,  specifically  at  least  in  cals,  and  that  e.xposure  to  or  experience  with 
the  stimuli  refines  the  processing  capability  and  the  cortical  representation 
of  that  stimulus.  In  the  same  way  that  a  perceptual  organization  was  set  up 
in  the  brains  of  the  six-month-old  infants,  so  categorization  and  learning 
occur  during  the  acquisition  of  a  first  or  second  language.  Learning 
occurred  in  these  animals  as  they  performed  the  auditory  discrimination, 
and  their  brain  representations  of  these  behaviorally  important  stimuli 


were  changed  in  parallel.  It  seems  plausible  that,  at  birth,  certain  basic 
auditory  perceptual  capabilities  are  present,  and  that  with  linguistic 
exposure,  certain  contrasts  are  sharpened  and  certain  other  unused  phonetic 
contrasts  drop  out  of  the  discriminative  repertoire.  The  cortical 
organization  thus  dynamically  represents  those  stimuli  to  which  the 
human/animLj.  has  been  exposed  and  which  are  informationally  and 
biologically  relevant  to  that  creature. 

Evolution  of  linguistic  abilities 

Clearly  humans  are  more  "intelligent"  than  cats.  If  in  doubt,  try 
training  a  cat  on  an  operant  discrimination  procedure.  Cats  were  able  to 
learn  the  paradigm,  however,  and  showed  quite  fine  resolution  abilities  (at 
least  two  out  of  three  cats  did),  but  there  was  variability  in  performance, 
and  two  other  cats  were  unable  to  learn  the  discrimination  procedure. 
Thresholds  for  the  two  adept  cats  were  about  3-4  times  higher  than 
humans  for  the  equivalent  ripple  density.  Their  intelligence  as  well  as  their 
perceptual  discrimination  abilities  for  detecting  envelope  phase  shifts  arc  at 
a  lower  level  but  are  on  the  same  continuum  as  humans. 

Watson  (1991)  has  recently  reported  significant  relationships 
between  simple  sensory,  cognitive,  and  motor  abilities  and  psychometric 
intelligence.  She  reminds  us  that  Spearman  (1904)  and  others  reported 
relationships  between  pitch  discrimination  and  intelligence.  The  suggestion 
is  that  working  memory  is  a  limited-capacity  system,  and  that  therefore 
individuals  who  can  process  information  rapidly  will  have  more  working 
memory  available  to  process  new  information.  Thus,  the  sensory  ability  to 
perceive  differences  and  make  discriminations  can  be  considered  part  of  a 
processing  capacity.  Learning  calls  upon  this  information-processing 
ability. 


98 


The  innate  abilities  with  which  humans  arc  bom  interact  with 
experiential  input,  especially  during  development  but  later  as  well,  to 
produce  a  cerebral  organization  which  reflects  that  interaction.  The  huge 
amount  of  neural  modification  that  occurs  within  the  brain  during 
development  allows  us  to  “make  sense  of  the  world”.  As  Edclman  (1987) 
says:  ”it  is  difficult  to  imagine  the  world  as  it  is  presented  to  a  newborn 
organism.. .the  environment  presented  [to  such  an  organism]  is  inherently 
ambiguous;  even  to  animals  eventually  capable  of  speech  such  as  ourselves, 
the  world  is  initially  an  unlabeled  place”  (p.3).  Adaptive  behavior  requires 
initial  categorization  of  salient  aspects  of  the  environment  so  that  learning 
can  occur  (p.4).  Thus,  perceptual  abilities  are  very  important  in  order  to 
form  concepts.  All  animals  are  bom  with  certain  basic  sensory  and 
discriminative  capacities;  all  must  interact  successfully  with  their 
environment  in  order  to  survive.  This  capacity  for  learning  and  for 
change  has  allowed  Homo  sapiens  to  evolve.  In  turn,  Homo  sapiens  has 
developed  speech  and  language,  culture  and  community.  Our  capacity  for 
speech  has  advanced  our  development  of  thought,  which  has  changed  the 
world.  These  faculties  have  evolved  over  many  eons  of  time  due  to  the 
mechanism  of  neural  adaptation.  How  this  cortical  change  occurs  is 
unknown.  Evidence  that  it  occurs  is  amazing  in  itself. 
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DISCRIMINATION  OF  VOWEL-LIKE  STIMULI  IN  THE  CAT 

M.  Diane  Keeling  William  M.  Jenkins,  fmd  Christoph  E.  Schreiner,  Coleman 
Memorial  Laboratory,  W.M.  Keck  Center  for  integrative  Neuroscience. 
University  of  California,  San  Francisco;  San  Francisco,  CA  94143-0732. 

*  now  at  Psychology  Dept.,  Box  5050,  University  of  New  Brunswick,  Saint 
John,  Saint  John,  N.B.  L2K  2E2,  Canada. 

Abstract 

The  objective  of  this  study  was  to  investigate  the  perception  of 
vowel-like  stimuli  by  measuring  the  ability  of  cats  to  detect  changes  in 
frequency  locations.  Three  cats  were  trained  and  tested  for  their  ability  to 
discriminate  frequency  peak  shifts  of  a  broad-band  spectrally-modulated 
harmonic  complex,  centered  at  3.675  kHz,  with  a  fundan^ental  frequency 
of  69  Hz,  in  a  2-Altemative  Forced  Choice  procedure.  The  time  required 
to  train  to  the  procedure  varied  from  3  weeks  to  2  months.  The  threshold 
(75%  correct)  was  lower  for  all  three  cats  at  the  end  of  testing,  as 
compared  to  the  threshold  at  the  beginning  of  training.  Performance 
improvements  were  continuous  and  progressive.  The  behavioral  thresholds 
correlated  with  the  electrophysiologically-measured  Q-40  values  of  overall 
neural  responses  (described  in  a  subsequent  paper),  indicating  that  a 
sharpening  in  tuning  was  related  to  the  degree  of  proficiency  in  the 
auditory  discrimination.  The  data  imply  that  language  acquisition  involves 
dynamic  cortical  reorganization  of  speech  feature  representations  and  that 
human  linguistic  capabilities  evolved  from  basic  mammalian  auditory 
processing  capacities. 


It  is  generally  believe<J  that  the  vocal  and  auditory  systems  have  co¬ 
evolved  so  that  "the  vocal  system  has  adapted  to  produce  sounds  suitable 
for  detection  and  processing  by  the  auditory  system,  and  the  auditory 
system  has  evolved  to  detect  and  process  these  sounds"  (Suga,  1988, 
p.684).  The  development  of  language  in  humans  may  have  taken  advantage 
of  acoustic  parameters  that  accentuate  certain  aural  differences  just  as 
vowels  are  discriminated  using  formants  -  those  frequencies  which  are 
boosted  by  the  particular  configuration  of  the  vocal  tract.  !t  may  also  be 
the  case  that  certain  acoustic  cues  common  to  speech  and  to  other 
biologically  significant  communicative  sounds  are  processed  by  basic 
neural  mechanisms  that  are  similar  in  humans  and  in  other  animals. 

Partly  because  of  the  complexity  of  speech  perception,  Liberman, 
Cooper,  Shankweiler,  &  Studdert- Kennedy,  (1967)  and  others  (c.g. 
Chomsky,  1972)  have  suggested  that  "speech  is  special",  i.c.  that  humans 
have  unique  linguistic  capacities  designed  specifically  for  producing, 
perceiving,  and  understanding  speech  and  language.  Furthermore,  it  has 
been  suggested  that  the  perception  and  production  of  speech  are  intimately 
related  in  that  the  perception  of  speech  is  directly  dependent  upon  neural 
"knowledge"  of  its  production.  In  this  respect,  speech  perception  is 
considered  to  be  qualitatively  different  from  other  forms  of  auditory 
communication. 

Animal  Communication 

There  has  been  considerable  discussion  as  to  whether  communication 
among  animals  is  comparable  to  communication  among  humans.  The 
discussion  has  revolved  around  defining  what  is  meant  by  "communication" 
vs.  "language"  (see  for  example  Tarttcr,  1986,  chapter  8;  and  Liebcrman, 
1984).  Attempts  have  been  made  to  train  animals  (mainly  apes,  gorillas. 
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and  chimpanzees)  to  use  human  language.  These  studies  were  unable 
howevcrv  to  differentiate  between  cognitive  abilities,  that  is,  abilities  to 
form  associations  between  concepts/objccts  and  an  ability  to  form  a  truly 
linguistic  symbolic  representation  of  an  object/idea.  (See  the  following 
references:  Gardner  &  Gardner,  1969;  Patterson,  1978  a,  b;  Premaclc, 
1971;  Rumbaugh,  Gill,  &  von  Glaserfield,  1973;  and  Terrace,  Pettito, 
Sanders,  &  Bever,  1979.) 

Field  data  for  vervet  monkeys  in  their  natural  environment  in  Kenya 
were  presented  by  Seyfarth,  Cheney,  &  Marler  in  1980.  The  animals  gave 
different  alarm  calls  to  different  classes  of  predators  and  reacted  to  play¬ 
backs  in  different  ways,  indicating  that  the  calls  carried  distinctive  semantic 
meanings.  Cheney  &  Seyfarth  (1988)  further  showed  that  monkeys  who 
had  learned  to  ignore  an  unreliable  signaler  (a  monkey  "crying  wolf?) 
also  ignored  an  acoustically  different  but  same  referent  call  from  the  same 
individual.  Such  transfer  did  not  occur,  however,  if  the  identity  of  the 
signaler  or  the  referent  of  the  call  changed.  The  authors  suggest  that 
vervet  monkeys,  "like  humans,  process  information  at  a  semantic,  and  not 
just  an  acoustic  level"  (p.477). 

Discrimination  of  specific  speech  features  was  reported  by  Kuhl  and 
Miller  in  1975,  in  the  chinchilla  (a  mammal  with  auditory  capabilities 
fairly  similar  to  those  of  humans  -  sec  Miller,  1970).  Using  a  shock 


avoidance  procedure,  they  trained  animals  to  respond  differciuiy  lu  a 


variety  of  spoken  /t/  and  /d/  consonant-vowel  syllables.  Once  trained,  the 


animals  generalized  to  other  speakers,  other  vowel  contexts,  ana  to 


computer-synthesized  /ta/  and  /da/  syllables. 


In  another  series  of  investigations,  Kuhl  &  Padden  (1982,  1983) 
tested  monkeys,  using  a  positive  reinforcement  technique,  wherein  the 
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monkey  was  trained  on  a  same-different  task,  i.e.  the  monkey  was 
rewarded  for  one  behavior  when  stimuli  were  the  same,  and  for  another 
when  the  stimuli  were  different.  Voice-onset-time  (VOT),  defined  as  the 
time  between  the  stop  release  burst  and  the  onset  of  vocal  cord  vibration 
(voicing),  and  place-of-articulation  discrimination  were  investigated.  In 
both  cases,  discriminability  was  poor  when  the  stimulus  pairs  were  from 
the  same  phonetic  category  and  good  when  they  were  from  different 
phonetic  categories. 

Other  investigators  have  reported  discrimination  of  speech 
parameters  in  animals.  For  example,  Morse  &  Snowdon  (1975)  reported 
categorical  discrimination  of  speech  sounds  by  rhesus  monkeys  using  the 
cardiac  component  of  the  orienting  response.  The  monkeys  significantly 
differentiated  stimuli  across  phonetic  boundaries,  but  also  discriminated 
within  phonetic  categories,  although  to  a  lesser  degree.  Sinnott,  Beecher, 
Moody,  &  Stebbins  (1976)  concluded  that  monkeys  and  humans  had  similar 
sensory  capacities;  however,  while  humans  showed  a  longer  latency  of 
response  when  both  stimuli  were  within  the  same  phonemic  category, 
monkeys  did  not.  They  argued  that  this  was  support  for  unique  speech 
processing  capacities  in  humans.  Sinnott  &  Adams  (1987)  measured 
difference  limens  to  various  VOT  stimuli  in  monkeys  and  humans  and 
observed  that  again,  monkeys  were  much  less  sensitive  to  speech  cues  than 
were  humans.  Waters  &  Wilson  (1976)  reported  perceptual  boundaries  in 
rhesus  monkeys  that  were  close  to  the  human  boundaries  between  voiced 
and  voiceless  consonants. 

Baru  (1975)  discussed  the  parameters  involved  in  discrimination  of 
vowels  in  the  dog.  Dogs  were  able  to  discriminate  /a/  from  /i/  regardless 
of  fundamental  frequency  (male  voice  vs.  female  voice),  and  discrimination 


of  vowels  appeared  to  rely  on  the  same  cues  as  those  for  humans  (mainly 
the  first  two  formant  frequencies  of  the  signal).  Following  either 
unilateral  or  bilateral  ablation  of  primary  auditory  cortex,  discrimination 
was  not  impaired  except  when  signal  duration  was  30  msec  (as  opposed  to 
75,  150,  or  300  msec). 

Dewson  (1964)  reported  vowel  discrimination  in  the  cat.  Cats  were 
trained  to  discriminate  between  /u/  and  /i/,  but  following  bilateral  ablation 
of  the  ventral  insular-temporal  cortex  (but  not  all  of  primary  auditory 
cortex)  did  not  retain  or  were  unable  to  relearn  the  discrimination.  The 
animals  could  still  discriminate  frequency  and  intensity  differences. 

Vowel  formants 

Vowels  are  characterized  by  their  formants  -  frequencies  which  are 
elevated  in  amplitude  relative  to  other  frequencies  and  which  are  produced 
by  the  natural  resonances  of  the  vocal  tract  activated  during  production  of 
the  vowel  (indicated  as  FI,  F2,  F3,  etc.).  Formant  values  show 
considerable  variability  across  speakers,  however,  and  even  overlap  for 
some  vowels  (Peterson  &  Barney,  1952).  The  variability  across  speakers  is 
due  to  physical  differences  in  vocal  tract  size  but  also  to  a  speaker's  past 
experience,  mainly  his/her  particular  dialectical  background,  but  the  values 
may  vary  within  context  and  even  across  repetitions  as  well. 

It  is  clear  that  in  spile  of  speaker  variation,  listeners  are  somehow 
able  to  adjust  for  this.  Miller  (1989)  reports  that  in  the  late  ISOO's  Richard 
John  Lloyd  formulated  the  formant-ratio  theory  of  the  vowel  stating  that 
"vowel  quality  depends  on  the  intervals  between  the  resonances,  not  their 
absolute  values"  (p.2155).  Miller's  auditory-perceptual  interpretation  of 
the  vowel  is  a  descendant  of  the  formant-ratio  theory  and  proposes  that 


i: : 

'  ■  * 

6 

■  ‘■i . 

T  •  - 

changes  in  spectral  patterns  of  formants  effectively  categorize  vowel 
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sounds. 
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The  ratios  of  the  formants  may  be  an  important  cue.  We  calculated 

the  ratios  of  the  first  and  second,  and  second  and  third  formants;  since 
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frequency  is  represented  on  a  logarithmic  scale  along  the  basilar  membrane 
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within  the  cochlea,  we  computed  the  ratios  of  the  formants  in  units  per 
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octave  (Miller,  1989)  and  hypothesized  that  the  perception  of  variations  in 
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the  position  of  spectral  peaks  is  dependent  on  the  spacing  of  the  peaks. 
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Cat  vocalizations  have  been  analyzed  and  appear  to  contain  spectral 
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peaks  of  energy  similar  to  formants  in  human  vowels  (Brown,  Buchwald, 
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Johnson,  &  Mikolich,  1978).  The  average  fundamental  frequency  for  cats 
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is  within  the  range  of  400-600  Hz  (Shipley.  Carterette,  &  Buchwald,  1990); 
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it  was  also  observed  that  the  frequency  vvith  the  largest  energy  is  usually  a 
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resonance  of  the  vocal  tract.  The  authors  state:  "the  presence  of  formants 

in  cat  vocalizations  and  the  magnitude  of  energy  in  these  frequencies 
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suggests  that  in  processing  conspecific  vocalizations  the  auditory  sy.stem  of 
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the  cat  needs  to  perform  extensive  identification  and  tracking  of  fomiant 

if!/''- 

patterns"  (p.l5). 
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Thus,  it  can  be  concluded  that  the  perception  of  vowels  involves  the 
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processing  of  peaks  of  spectral  energy,  the  frequencies  of  which  are  located 
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at  various  distances  or  spacings.  In  order  to  elucidate  the  precision  of  this 
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p.rocessing,  vowel-like  stimuli  were  presented  to  cats  in  a  2-Alternative 
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Forced  Choice  testing  procedure.  Holding  the  peak  spacing/ripple  density 
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constant,  in  each  test  condition,  the  phase  of  the  envelope  of  the  stimulus 

was  shifted  so  that  the  locations  of  the  peaks  changt-d.  The  precision  with 

which  the  peak  shifts  could  be  determined  was  expected  to  shed  light  upon 
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the  importance  of  small  changes  in  formant  location  for  vowel  processing. 
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MET?IODS 

Three  cats  v/cre  trained  to  detect  shifts  in  the  phase  of  the  stimulus 
envelope.  A  tivo~alternaiive  forced  choice  paradigm  was  used.  All 
procedures  were  approved  by  the  University  Committee  on  Animal 
Research  (protocol  #  A886-05785-03).  The  stimulus  generation  and 
response  tabulation  were  controlled  and  presented  with  the  LabVIEW 
application. 

The  animals  were  food  deprived  to  90-  95%  of  their  pre-fast  weight; 
reward  during  testing  was  diluted  canned  cat  food  (Hill's  Prescription  Diet 
feline  c/d).  The  food  was  dispensed  from  a  large  dose  syringe  using  a 
stepping  motor,  which  pumped  a  small  amount  of  food  onto  a  fiberglass 
plate,  about  6  cm  in  diameter,  which  the  cat  licked.  The  food  reward 
system  was  modelled  after  a  dispensing  system  devised  by  Thompson, 

Porter,  O'Bryan,  Heffner  &  Heffner  (1990).  The  cats  were  weighed  daily 
and  received  a  supplement  of  dry  cat  food  after  testing  in  order  to  obtain 
their  total  daily  nutritional  requirement.  The  test  cage  was  located  within  a 
sound-attenuating  chamber.  Behavior  was  monitored  continuously  with  a 
video  camera. 

Training  procedure 

Initially,  a  hungry  cat  was  placed  in  the  test  cage,  allowed  to  explore 
its  new  environment,  and  received  food  rewards  under  the  investigator's 
control  on  a  small  food  reward  plate.  In  the  next  stage,  the  cat  was  trained 
to  press  a  nose  key  located  at  mid-line  to  indicate  an  observing  ("ready") 
response,  for  which  it  received  food.  The  observing  response  not  only 
ensured  that  the  cat  was  attending  and  could  initiate  trials  at  its  own  pace 
and  volition,  but  also  forced  the  cat's  head  position  to  be  constant  from 
trial  to  trial.  A  center  key  press  automatically  triggered  a  food  reward. 
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Aiternativeiy,  if  the  cat  required  "shaping",  the  investigator  could  produce 
a  food  reward  as  the  cat  progressively  came  closer  and  closer  to  pressing 
the  center  key  on  its  own.  For  instance,  if  the  cat  oriented  its  head  toward 
the  center  key,  or  moved  toward  the  center  key,  or  lightly  pressed  the 
center  key,  the  investigator  would  deliver  a  food  reward,  requiring  over 
trials  behavior  that  came  closer  and  closer  to  the  desired  autonomous 
behavior. 

Once  the  cat  was  reliably  pressing  the  center  key  in  order  to  receive 
the  food  reward,  auditory  stimuli  were  introduced;  reward  then  required  a 
center  key  press,  which  was  followed  by  auditory  stimulus  presentation,  to 
which  the  cat  was  required  to  make  a  response  either  to  the  left  or  to  the 
right  side.  Responses  were  indicated  on  two  spatially  separated  nose-press 
keys,  one  to  each  side  of  the  middle  observing  key.  (See  Figure  1.)  To 
facilitate  this  stage  of  learning,  the  investigator  usually  remained  present  in 
the  sound  booth  and  baited  the  side  keys  with  a  small  amount  of  food,  by 
inserting  a  small  narrow  plastic  tube  with  food  on  the  end  into  the  cage  and 
onto  the  response  key,  which  the  cat  readily  followed.  Eventually,  by 
indicating  the  correct  response  with  only  a  touch  to  or  movement  toward 
the  correct  key,  the  investigator  could  help  the  cat  acquire  this  behavior. 
Shaping,  by  giving  reward  as  the  animal  approached  the  correct  response, 
on  the  control  panel  outside  the  booth,  was  also  later  used  to  assist  the 
learning  of  this  stage  by  the  animal. 

Initially,  stimuli  were  presented  from  two  small  lateralized  speakers, 
which  were  located  at  approximately  60  to  80  degrees  to  the  left  and  right 
of  midline.  At  the  beginning  of  training  the  stimulus  on  the  left  side  was 
either  two  or  three  stimuli,  T  the  same  phase  (always  0),  (designated  AAA 
or  AA);  the  stimulus  on  the  right  was  maximally  different,  i.e.  180  degrees 
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different  in  phase  from  the  left  stimulus  (designated  BBB  or  BB).  Over 
time  the  stimuli  were  changed  to  be  AA  on  the  left,  and  AB  on  the  right. 
The  requirement  that  the  animal  go  left  for  reward  to  a  left-side  stimulus 
presentation  and  go  right  to  a  right-side  stimulus  presentation  was  usually 
fairly  easily  learned  by  the  cats  and  took  approximately  2  to  3  weeks  of 
training. 

As  the  animal  learned  to  go  left  or  go  right  based  upon  the  stimulus 
presented,  the  speakers  were  moved  in  to  the  center  location  gradually  until 
finally  the  ripple  stimuli  were  presented  from  a  single  speaker  overhead. 
Stimulus  pair  AA  (i.e.  two  identical  stimuli)  required  Response  A  (left-side 
key  press)  in  order  to  receive  reward;  Stimulus  pair  AB  (i.e.  second 
stimulus  different  from  the  first)  required  Response  B  (right-side  key 
press)  in  order  to  receive  reward.  The  switch  from  discrimination  based 
upon  side  of  stimulus  presentation  to  discrimination  based  upon  the  quality 
of  the  stimuli,  that  is,  two  stimuli  the  same  vs.  a  second  stimulus  different 
from  the  first,  was  quite  difficult  for  the  cats  to  acquire.  Some  animals 
were  unable  to  learn  this  task,  and  their  training  was  discontinued.  The 
three  cats  which  were  able  to  learn  this  discrimination  varied  in  length  of 
time  required  to  learn  the  procedure  from  a  minimum  of  3  weeks  to  a 
maximum  of  two  months.  Performance,  i.e.  percent  correct  overall,  and 
percent  cf  rrect  on  the  left  and  percent  correct  on  the  right,  was  calculated 
on-line,  as  for  human  subjects.  Unlike  the  human  testing  however,  there 
was  no  pre-set  number  of  trials  per  session;  cats  continued  working  until 
they  were  sated  and  did  not  initiate  any  further  trials.  They  usually 
performed  anywhere  from  100  to  250  trials  per  day.  At  the  end  of  the  test 
session,  the  stimulus  phase  and  response  values  were  stored.  Thresholds 
were  computed  later,  in  the  same  manner  as  that  used  for  the  human 
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subjects,  i.e.  by  a  linear  regression  formula  applied  to  the  psychometric 
function  which  calculated  the  phase  shift  correctly  detected  on  more  than 
75%  of  the  trial  presentations. 

Stimulus 

Stimuli  were  ripple  spectra  centered  at  3.675  kHz,  with  a  ripple 
density  of  1.00  ripple/octave,  a  fundamental  frequency  of  69  Hz  and  a 
ripple  bandwidtn  of  3  octaves.  The  stimulus  extended  from  910  Hz  to 
7,153.2  Hz.  The  center  frequerxy  of  3.675  kHz  was  not  so  low  as  to 
preclude  the  possibility  of  recording,  since  frequencies  lower  than 
approximately  500  Hz  to  1  kHz  are  usually  located  within  the  posterior 
ectosylvian  sulcuf,  and  difficult  to  record  from.  Also,  cat  vocalizations 
appear  to  contain  pea’rs  of  spectral  energy,  the  largest  being  usually  a 
resonance  of  the  vocal  tract  and  located  between  about  1  and  2  kHz,  with 
spectral  peaks  extending  up  to  approximately  4  kHz  (Shipley,  Carterette,  & 
Buchwald,  1990).  Thus,  the  center  frequency  of  3,675  kHz  is  a  frequency 
likely  to  be  of  interest  to  cats,  and  possible  to  record  from  on  the  cortical 
surface  (see  also  Heffner  &  Heffner-  the  hearing  range  of  the  cat,  1985). 

The  phase  of  the  envelope  of  the  stimulus  was  shifted,  originally  by  180 
degrees,  in  a  positive  direction  only  (a  downward  shift  in  the  frequency  of 
the  center  peak);  the  range  of  values  was  narrowed  and  lowered  gradually 
as  the  animal's  performance  improved.  The  ripple  density  of  the  ripple 
stimuli  was  constant  at  one  lipple/ociavc;  thus,  the  distance  between  the 
ripple  peaks  remained  the  same,  however  the  frequency  locations  of  the 
peaks  changed  with  the  shift  in  ripple  phase.  E)epth  of  modulation  of  the 
envelope  was  30  dB.  The  intensity  of  the  stimuli  varied  randomly  by  +/-  3 
dB  around  approximately  64  dB  SPL,  Figure  2  illustrates  the  ripple 
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stimulus  used  for  animal  testing.  At  the  end  of  testing,  the  cats  underwent 
electrophysiologica!  recording. 
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Figure  1.  Configuration  of  Animal  Response  Keys  and  Reward 
Plate.  If  the  two  stimuli  were  the  same,  the  cat  pressed  the  left 
response  key  to  receive  reward;  if  the  second  stimulus  was 
different,  the  cat  pressed  the  right  response  key  to  receive 
reward. 
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Figure  2,  Ripple  Stimulus  for  Animal  Testing,  showing  0 
degree,  180  degree,  and  90  degree  phase  shift.  Corresponding 
frequency  locations  are  shown:  3.675,  2.6,  and  3.1  kHz. 
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and  negative  180  degrees,  at  locations  of  4.3  and  5.1  kHz* 


RESULTS 

The  first  behavioral  cat  (^218)  began  training  at  age  ten  months.  It 
required  almost  two  months  to  shape  the  cat  to  reliably  perform  detection 
of  ripple  envelope  phase  shifts.  The  cat  was  subsequently  tested  over  a 
period  of  five  months  before  it  underwent  elcctrophysiological  recording. 
Threshold  was  computed  after  each  daily  session  as  the  smallest  degree 
positive  shift  in  phase  of  the  envelope  of  the  stimulus  that  the  cat  could 
detect  with  75%  or  greater  accuracy.  An  example  of  a  psychophysical 
function  for  this  cat  is  shown  in  Figure  3.  Its  threshold  in  this  particular 
test  session  was  approximately  57  degrees. 

Figure  4a  shows  the  performance  of  the  animal  over  the  lime  course 
of  the  test  sessions.  The  average  threshold  over  the  first  ten  test  sessions 
was  about  127  degrees.  The  final  threshold  for  this  animal  (its  average 
over  the  last  ten  test  sessions)  was  a  phase  shift  of  80  degrees. 

The  second  trained  cat  (Jf29'i)  began  training  at  age  eight  months;  it 
required  only  3  weeks  to  train  to  the  procedure.  It  was  tested  over  a  six- 
month  period,  and  then  underwent  elcctrophysiological  recording.  Figure 
4b  shows  the  improvement  in  threshold  over  time  for  this  cat.  The  average 
threshold  over  the  first  ten  test  sessions  for  this  cat  was  76  degrees;  its  final 
threshold  (the  average  of  the  last  ten  sessions)  was  18  degrees. 

The  third  trained  cat  (#176)  began  testing  at  age  six  months  and 
required  two  months  to  train  to  the  procedure.  It  was  tested  over  a  four 
month  period,  then  underwent  elcctrophysiological  recording.  Figure  4c 
shows  the  change  over  time  of  the  threshold  for  this  animal.  The  average 
threshold  over  the  first  ten  test  sessions  for  this  cat  was  66  degrees.  Its 
final  thr  ;shold  (again,  the  average  of  the  final  ten  test  sessions)  was  33 
degrees. 


A  paired  two-group  one-tail  t-test  comparing  the  average  threshold 
at  the  beginning  of  testing  (averaged  over  ten  consecutive  testing  days)  to 
the  final  threshold  for  all  three  cats  was  performed,  with  the  test  hypothesis 
being  that  the  latter  threshold  was  smaller.  The  difference  was  significant 
at  p=0.01. 

A  comparison  of  the  three  cats'  peifcrmance  over  time  is  shown  in 
Figure  4<i.  Note  that  the  thresholds  for  the  first  trained  cat  (#218)  were 
higher  overall  than  the  thresholds  for  the  other  two  cats. 

Two  other  cats  (#122  and  40764)  were  unable  to  learn  the 
discrimination  procedure;  they  were  able  to  respond  to  the  stimulus  on  ihc 
basis  of  location  of  the  sound,  i.e.  they  were  able  to  respond  when  two 
lateralized  speakers  were  used.  However,  they  were  unable  to  make  the 
switch  to  discriminating  the  stimuli  coming  from  one  overhead  speaker, 
based  on  the  quality  of  the  sounds.  It  is  interesting  to  note  that  these  two 
cats,  in  contrast  to  the  three  successfully  trained  animals,  were  generally 
nervous,  timid,  and  slow-moving.  They  also  underwent 
clcctiophysiological  recording,  and  were  considered  "exposed"  control 
animals;  i.e.  they  had  been  exposed  to  the  stimuli,  but  had  not  performed 
o|)erant  discriminations  based  upon  a  perception  of  differences  in  these 
stimuli. 

Three  out  of  five  cats  were  able  to  leara  a  two-altemative  forced 
choice  auditory  discrimination  procedure.  The  time  required  to  train  to 
the  procedure  varied  from  3  weeks  to  2  months.  The  threshold  was  lower 
for  all  three  cats  at  the  end  of  testing,  as  compared  to  the  threshold  earlier 
in  training.  Performance  improvements  were  continuous  and  progressive. 


Percent  Correct 


Figure  3.  Typical  psychophysical  function  for  cat  v'V218. 
Threshold  envelope  phase  shift  (75%  correct)  is  approximately 
57  degrees. 


threshold  phase  shift  (in  degrees) 
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Figure  4  a)  Performance  over  time  of  Cat  #218,  b) 
Performance  over  time  of  Cat  #293. 


c)  cat  91 /» 


Figure  4  c)  Performance  over  time  of  Cat  #176,  d)  Comparison 
of  thresholds  over  time  of  three  trained  cats. 


DISCUSSION  &  CONCLUSIONS 


The  results  of  this  research  are  related  to  various  issues,  including: 
a)  language  acquisition  and  language  learning,  and  b)  what  general 
mammalian  auditory  processing  capacities  have  to  tell  us  about  the 
evolution  of  linguistic  capabilities. 

[.anguage  acquisition  and  language  learning 

Infant  studies  have  shown  that  the  young  infant  has  a  very 
sophisticated  ability  to  perceive  the  basic  sounds  of  language  (Eimas  & 
Tartter,  1979).  Infants  even  below  the  age  of  4  months  arc  able  to  make 
discriminations  among  steady-state  vowels.  Trchub  (1973)  showed  that  [a] 
could  be  distinguished  from  [i]  whether  in  isolation  or  in  the  context  of  a 
preceding  stop  consonant.  Infants  could  also  distinguish  [i]  from  [u]. 
Swoboda,  Morse,  &  Leavitt  (1976)  showed  that  the  vowel  contrast  [i]  vs  [I], 
as  in  "beet  "vs  "bit"  was  discriminable  by  infants.  Trchub,  Endman,  & 
Thorpe  (1990)  presented  complex  stimuli  that  differed  in  spectral  structure 
to  7  to  8.5  month-old  infants.  Infants  successfully  differentiated  two 
spectral  structures,  even  when  they  varied  in  fundamental  frequency, 
intensity,  or  duration.  It  was  concluded  that  infants  classify  tonal  stimuli 
on  the  basis  of  cimbre.  Similarly,  Clarkson,  Clifton,  &  Perris  (1988) 
presented  sounds  that  differed  in  their  spectral  envelopes  to  7-month-old 
infants,  who  successfully  discriminated  the  sounds.  They  point  out  that 
vowels  have  multidimensional  qualities  which  distinguish  them 
individually;  they  argue  that  timbre  is  a  multidimensional  attribute,  and 
point  to  evidence  that  shows  that  "adults'  perception  of  tonal  complexes 
reveals  that  the  spectral  envelope  is  the  most  important  determinant  of 
timbre  (Plomp,  1970)"  (p.20). 
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Thus,  the  ability  to  process  vowel  differences  based  upon  changes  in 
spectral  envelope  seems  to  be  an  innate  ability  in  humans.  From  the  animal 
behavioral  data  presented,  discriminability  of  changes  in  spectral  envelopes 
is  also  present  in  cats.  It  may  be  a  basic  mamnmlian  auditory  processing 
capacity. 

Even  though  humans  seem  to  have  innate  speech-processing 
capabilities,  experience  does  seem  to  have  an  effect.  Streeter  (1976) 
investigated  the  discrimination  of  voicing  onset  by  infants  reared  in  a 
linguistic  environment  in  which  the  English  voiced/voiceless  distinction  was 
not  used.  The  results  suggested  that  "previous  exposure  to  the 
prevoiced/voiced  distinction  in  the  language  or  perhaps  exposure  to  the 
acoustic  cues  underlying  this  distinction  is  a  prerequisite  for  discrimination 
of  the  contrast.  On  the  other  hand,  the  voiced/voicclcss  discrimination  can 


be  made  in  the  absence  of  relevant  linguistic  exposure,  since  Kikuyu  infants 
do  not  hear  this  voicing  distinction  used  in  their  language.  Thus,  the 
results  suggest  that  there  is  an  interaction  between  nature  and  nurture; 
sonie  phonetic  or  acoustic  discriminations  may  be  unive.^al,  whereas  others 
seem  to  require  previous  relevant  exposure"  (p.40). 

Kuhl,  Williams,  Lacerda,  Stevens,  &  Lindblom  (1992)  observed  that 
exposure  to  a  specific  language  in  the  first  six  months  of  life  alters  infants' 
phonetic  perception.  Infants  in  America  and  Sweden  were  tested  with  both 
native-  and  foreign-language  vowel  sounds.  "Infants  from  both  countries 


pointed  out,  this  perceptual  organizing  occurs  well  before  the  age  at  which 


children  begin  to  acquire  word  meanings,  i.c.  well  before  the  acquisition  of 


language.  It  was  suggested  that  "linguistic  experience  shrinks  the 


perceptual  distance  around  a  native-language  prototype,  in  relation  to  a 


nonprototype,  causing  the  prototype  to  pcrcepiiially  assimilate  similar 
sounds"  (p.608).  Thus  phonetic  prototypes  were  conjectured  to  be 
"fundamental  perceptual-cognitive  building  blocks"  (p.608). 

The  data  in  this  study  support  the  idea  that  certain  basic  auditory 
processing  capabilities  are  present  in  cats,  and  that  exposure  to  or 
experience  with  the  stimuli  refines  the  processing  capability  and  the 
cortical  representation  of  that  stimulus.  In  the  same  way  that  a  perceptual 
organization  was  set  up  in  the  brains  of  the  six-month-old  infants,  so 
categorization  and  learning  occur  during  the  acquisition  of  a  first  or  second 
language.  Learning  occurred  in  these  animals  as  they  performed  the 
auditory  discrimination,  and  their  brain  representations  of  these 
behaviorally  important  stimuli  were  changed  in  parallel.  It  seems  plausible 
that,  at  birth,  certain  basic  auditory  perceptual  capabilities  are  present,  and 
that  with  linguistic  exposure,  certain  contrasts  are  sharpened  and  certain 
ether  unused  phonetic  contrasts  drop  out  of  the  discriminative  repertoire. 
The  cortical  organization  thus  dynamically  represents  those  stimuli  to 
which  the  human/animai  has  been  exposed  and  which  are  informationally 
and  biologically  relevant  to  that  creature. 

Evolution  of  linguistic  abilities 

Clearly  humans  are  more  "intelligent"  than  cats.  (If  in  doubt,  try 
training  a  cat  on  an  operant  discrimination  procedure.)  Cats  were  able  to 
learn  the  paradigm,  however,  and  showed  good  resolution  abilities  (at  least 
two  out  of  three  cats  did),  but  there  was  variability  in  performance,  and 
two  other  cats  were  unable  to  learn  the  discrimination  pvocedure. 
Thresholds  for  the  two  adept  cats  were  about  3-4  times  higher  than 
humans  for  the  equivalent  ripple  density  (Keeling,  Schreiner,  &  Jenkins, 
1992).  Their  intelligence  as  well  as  their  perceptual  discrimination  abilities 


for  delecting  envelope  phase  shifts  are  at  a  lower  level  but  are  on  the  same 
continuum  as  humans. 

Watson  (1991,  recently  reported  significant  relationships 
between  simple  sensory,  cognitive,  and  motor  abilities  and  psychometric 
intelligence.  She  reminds  us  that  Spearman  (1904)  and  others  reported 
relationships  between  pitch  discrimination  and  intelligence.  The  suggestion 
is  that  working  memory  is  a  limited-capacity  system,  and  that  therefore 
individuals  who  can  process  information  rapidly  will  have  more  working 
memory  available  to  process  new  information.  Thus,  the  sensory  ability  to 
perceive  differences  and  make  discriminations  can  be  considered  part  of  a 
processing  capacity.  Learning  calls  upon  this  information-processing 
ability. 

The  innate  abilities  with  which  humans  arc  born  interact  with 
experiential  input,  especially  during  development  but  later  as  well,  to 
produce  a  cerebral  organization  which  reflects  that  interaction.  The  huge 
amount  of  neural  modification  that  occurs  within  the  brain  during 
development  allows  us  to  “make  sense  of  the  world”.  As  Edelman  (1987) 
says:  “it  is  difficult  to  imagine  the  world  as  it  is  presented  to  a  newborn 
organism.. .the  environment  presented  [to  such  an  organism]  is  inherently 
ambiguous;  even  to  animals  eventually  capable  of  speech  such  as  ourselves, 
the  world  is  initially  an  unlabeled  place”  (p.3).  Adaptive  behavior  requires 
initial  categorization  of  salient  aspects  of  the  environment  so  that  learning 
can  occur  (p.4).  Thus,  perceptual  abilities  are  very  important  in  order  to 
form  concepts.  All  animals  are  bom  v/ith  certain  basic  sensory  and 
discriminative  capacities;  all  must  interact  successfully  with  their 
environment  in  order  to  survive.  This  capacity  for  learning  and  for 
change  has  allowed  Homo  sapiens  to  evolve.  In  turn.  Homo  sapiens  has 
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developed  speech  and  language,  culture  and  community.  Our  capacity  for 
speech  has  advanced  our  development  of  thought,  which  has  changed  the 
world.  These  faculties  have  evolved  over  many  eons  of  time  due  to  the 
mechanism  of  neural  adaptation.  How  this  cortical  change  occurs  is 
unknown.  Evidence  that  it  occurs  is  amazing  in  itself. 
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Abstract 

The  perception  of  phase  shifts  in  the  spectral  envelope  of  vowel-like  ripple 

stimuli  was  investigated  by  measuring  the  ability  of  humans  to  detect  frequency 

shifts  in  fonnant/ripplc  peak  location.  Tlie  standard  ripple  stimulus  was  a 

harmonic  complex  with  a  fundamental  frequency  of  approximately  45  Mz  and  a 

bandwiddi  of  three  octaves  centered  at  1.38  kHz;  the  ripple  phase  of  the 

standard  stimulus  was  0  degrees;  the  ripple  depth  was  20  ITic  component 

amplitudes  of  the  unmodulated  signal  decreased  by  6  dB/octavc.  Six  humans 

were  tested  in  a  modified  2-Altemative  Forced  Choice  paradigm  for  their 

ability  to  discriminate  envelope  phase  shifts  in  these  vowel-like  ripple  stimuli. 

Ripple  dcnsitie.s  from  0.5  ripples/octave  to  8  ripples/octave  were  tested. 

Difference  tliresholds  (75%  correct)  were  reported  in  degrees  of  envelope 

phase  shift  relative  to  the  standard.  The  smallest  difference  detectable  in  ripple 

phase  shifts  was  in  tlic  range  from  3  to  30  degrees,  llie  highest  thresholds  were 

found  for  ripple  densities  between  3.5  and  5  ripples/octave.  Tliresholds  were 

lowest  for  ripple  densities  below  approximately  4  ripples/octavc  -  the  peak 

spacings  that  correspond  to  die  majority  of  fonnant  ratios  in  English  vowels. 
PACS  43.71 
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INTRODUCTION 

Vowels  are  characterized  by  the  positional  spicing  of  formants  - 
component  frequency  regions  which  are  elevated  in  amplitude  relative  to 
other  component  frequencies  and  which  are  produced  by  the  natural 
resonances  of  the  vocal  tract  invoked  during  production  of  the  vowel. 
Formant  values  show  considerable  variability  across  speakers  however,  and 
even  overlap  for  some  vowels  (Peterson  &  Barney,  1952).  Nevertheless, 
listeners  are  quite  proficient  at  identifying  vowels. 

How  vowels  are  perceived  has  been  the  subject  of  much  speculation 
and  examination.  Green  (1988)  and  colleagues  investigated  the 
discrimination  of  a  change  in  spectral  shape;  they  called  this  mechanism 
profile  analysis;  and  felt  that  a  critical  feature  in  the  detection  process  is  tlie 
simultaneous  comparison  of  the  intensity  level  at  different  frequency 
regions  of  the  spectrum.  Green  points  out  that  changes  in  tlie  relative  level 
of  different  parts  of  the  acoustic  spectrunv  are  perceived  by  listeners  as 
changes  in  "sound  quality".  Sound  quality  differences  exemplify  vowel 
discr’.mination. 

Miller  (1989)  reports  that  in  the  late  1800's  Richard  John  Lloyd 
formulated  the  fomiant-ratio  theory  of  the  vowel  stating  that  "vowel 
quality  depends  on  tlie  intervals  between  the  resonances,  not  tlieir  absolute 
values"  (p.2155).  Miller's  auditory-perceptual  interpretation  of  the  vowel 
is  a  descendant  of  the  formant-ratio  theory  and  proposes  that  changes  in 
spectral  patterns  of  formants  effectively  categorize  vowel  sounds. 

The  ratios  of  the  formants  may  be  an  important  cue.  We  calculated 
tlie  ratios  of  the  first  and  second,  and  second  and  third  formants;  since 
however,  frequency  is  represented  on  a  logarithmic  scale  along  the  basilar 
membrane  within  the  cochlea,  we  computed  the  ratios  of  tlie  formants  in 
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units  per  octave  (Miller,  1989).  llie  spacing  of  the  formants  or  ripple 
density  for  the  formant  ratio  F1/F2  and  F2/F3  of  10  English  vowels  is 
shown  in  Figure  1  with  values  grouped  for  man,  woman,  and  child.  Tiie 

average  frequency  values  for  the  formants  were  taken  from  .  (see 

The  Science  of  Sound).  Ripple  densities  extend  from  0.3  ripples/octave  to 
4.5  ripples/octave,  with  the  majority  being  less  than  3  ripples/octave.  Note 
that  the  ripple  densities  for  most  vowels  across  men,  women,  and  children 
were  quite  similar. 

We  hypothesized  that  the  perception  of  variations  in  the  the  position 
of  spectral  peaks  is  dependent  on  tlie  spacing  of  the  peaks.  We  set  out  to 
measure  humans'  ability  to  discriminate  among  vowel-like  complex  stimuli. 

We  used  a)  a  broad-band  stimulus  (3  octaves  wide)  in  order  to  simulute  a 
natural  vowel,  b)  a  harmonic  spectrum,  again  in  order  to  mimic  vowels,  c) 
a  sinusoidal  envelope  pattern  on  a  logarithmic  scale,  and  we  chose  d)  20  dB 
as  the  depth  of  modulation  of  the  standard  stimuli.  The  precision  with 
which  the  peak  shifts  could  be  detennined,  and  the  variation  of  this 
precision  across  several  ripple  densities  was  expected  to  shed  light  upon  the 
importance  of  small  changes  in  formant  location  for  vowel  processing,  and 
thus  elucidate  the  perception  of  vowels. 

METHODS 

Stimulus  generation  and  presentation  and  response  tabulation  were 
controlled  by  a  Macintosh  Ilx  computer.  Tne  LabVIEW  application 
(National  Instruments  Corporation)  was  used  to  automate  procedures  for 
data  acquisition  and  instrument  control.  A  program  was  created  using  a 
DSP  board  and  output  via  a  1<:  bit  Audio  board,  that  allowed  the 
presentation  of  ripple  stimuli  to  the  subject  in  the  sound  booth,  and  to 
record  and  tabulate  on-line  the  responses  of  tlie  subject.  An  input/output 
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board  served  as  the  interface  between  the  computer  and  the  subject.  A  H 

I 

Crown  D-75  amplifier  amplified  the  signals.  |H 

I 

Subjects  and  Procedure  H 

1 

Six  persons  (two  men  and  four  women;  mean  age:  35  years,  range  22  H| 

1 

to  44  years)  with  normal  hearing  in  the  tested  frequency  range  as  H 

determined  by  professional  audiological  assessment,  were  tested  using  a 

1 

modified  2-Altemative  Forced  Choice  (2-AFC)  procedure.  ITie  subject 

1 

was  seated  in  a  sound-attenuating  chamber  (lAC).  The  subject  was  told  tliat  H 

upon  initiating  a  trial  by  a  center  button  press  on  a  panel  held  on  tlie  lap, 

1 

s/he  would  hear  through  headphones  (AKG  -K240DF)  three  sounds.  The  H 

stimulus  presentation  was  monaural,  to  the  left  ear.  The  task  of  the  subject  H 

mHH 

1 

was  to  indicate  which  of  the  three  sounds  was  different  from  the  other  two.  H 

The  first  stimulus  served  as  a  standard  stimulus  and  was  always  the  same  H| 

for  a  single  test  session.  Either  the  second  or  the  third  stimulus  was  M 

i 

different  from  the  other  two;  the  subject  was  to  indicate  tlie  odd  stimulus  H 

by  a  button  press:  left  button  if  the  second  stimulus  was  different  from  the  H 

other  two,  right  button  if  the  third  stimulus  was  different.  A  green  light  M 

:l., 

tJ^M 

was  illuminated  if  the  response  was  correct.  In  each  test  session  there  were  H 

9 

147  trials  (21  different  triads,  each  presented  7  times).  A  session  usually  H 

■H 

fl 

took  about  ten  minutes.  Subjects  usually  completed  three  or  four  sessions  |H 

per  sitting.  H 

H 

Stimulus  S 

B 

The  formant  values  for  ten  English  vowels  (values  takes  from  H 

■ 

Peterson  &  Barney,  1952)  were  converted  into  ratios  for  the  first  formant  H 

B 

and  second  formant  (F1/F2)  and  for  the  second  formant  and  third  formant  H 

BAH 

^Bl 

B 

(F2/F3).  These  values  were  then  converted  to  a  logarithmic  scale,  giving  H 

!$■ 

i^S 

E^fl^l 

the  number  of  peaks  or  ripples  per  octave,  also  termed  tlie  ripple  density.  H 

_  _  __  __ . 
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Figure  1  shows  the  ripple  densities  for  F1/F2  and  for  F2/F3.  Values  range 
from  0.3  to  4.5  ripples  per  octave,  with  the  majority  below  3 
ripples/octave. 

In  order  to  test  the  discriminability  of  envelope  phase  shifts  as  a 
function  of  formant  spacings,  a  ripple  stimulus  was  created  with  the 
following  characteristics:  a)  the  earner  consisted  of  a  harmonic  scries  with 
a  fundamental  frequency  of  approximately  45  Hz  (to  give  closely  spaced 
harmonics)  and  a  6  dB/octave  decline  of  the  component  amplitudes;  b)  tlte 
bandwidth  of  the  stimulus  was  3  octaves;  c)  the  spectral  envelope  of  the 
signal  was  represented  by  a  sinusoid  on  a  logarithmically  scaled  frequency 
axis,  tlie  frequency  of  the  envelope  sinusoid  was  referred  to  as  ripple 
density  and  measured  as  rippies/octave,  varied  from  0.5  to  8  ripples/octave; 
and  d)  the  modulation  depth  of  the  envelope  (lipple  depth)  was  linear  on  a 
dB  scale,  and  was  set  at  20  dB. 

The  standard  ripple  stimulus  (Figure  2)  always  contained  a  central 
formant  that  was  centered  at  1.38  kHz.  This  frequency  is  approximately 
centered  within  the  range  of  formant  frequencies  for  English  vowels:  (270 
Hz  to  3010  Hz;  Peterson  &  Barney,  1952).  Ihe  stimulus,  being  three 
octaves  wide,  extended  from  340  Hz  to  2,672  Hz.  Within  each  trial,  it  was 
deemed  necessary  tliat  the  total  energy  content  of  each  stimulus  be  kept 
constant;  this  was  done  by  varying  the  phase  of  the  envelope  of  the 
siimulus,  instead  of  the  ripple  density. 

The  ripple  phase  of  the  standard  stimulus  was  0  degrees,  i.e.  the 
signal  (or  odd)  ripple  stimulus  had  a  phase  varying  from  1  degree  to  180 
degrees.  — Figure  3  illustrates  a  ripple  stimulus,  with  ripple  density  of  2, 
shifted  45  degrees.  As  can  be  seen,  the  frequency  values  of  the  peaks  are 
shifted.  ITie  phase  was  shifted  in  botli  a  positive  and  a  negative  direction 
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relative  to  the  standard  stimulus;  for  the  positive  pliase  shift,  frequency 
peaks  shifted  to  lower  frequencies,  for  the  negative  phase  shift,  frequency 
peaks  shifted  to  higher  frequencies. 

The  intensity  of  tlie  stimulus  was  60  dB  SPL.  All  three  stimuli 
within  each  trial  were  presented  at  the  same  level,  as  it  was  found  to  be  too 
difficult  for  subjects  to  ignore  intensity  variations  of  more  than  2-3  dB 
while  discriminating  phase  shifts.  However,  the  intensities  of  the  stimuli 
varied  randomly  by  +/-  3  dB  around  the  standard  60  dB  SPL  stimulus 
across  trials. 
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Threshold  computation 

The  minimum  phase  shift  of  the  envelope  that  could  be  detected  with 
this  modified  2-AFC  method  was  computed  in  the  following  way.  In  each 
testing  session,  one-hundred  forty-seven  trials  were  presented.  In  each 
trial,  either  the  second  or  third  stimulus  was  different  from  the  other  two. 
The  subject  was  forced  to  make  a  choice  and  was  thus  either  correct  or 
incorrect  on  each  trial.  At  the  beginning  of  the  session,  the  range  of  phase 
shift  values  was  set,  with  1  to  180  degrees  being  the  largest  range.  The 
ideal  was  to  present  a  range  of  values  that  challenged  the  subject  in  order  to 
define  the  true  tlireshold,  but  that  was  not  so  difficult  as  to  produce 
extreme  fmstration  — for  the  subject  (Green,  1990).  Twenty-one  values 
within  this  range  were  then  randomly  presented  as  the  "different  stimulus", 
each  being  presented  7  times  (thus  equaling  a  total  of  147  trials).  'Ihe 
number  of  responses  con'ect  at  each  phase  shift  value  were  tabulated  on¬ 
line  by  computer.  On  screen  could  be  seen  the  overall  percent  of  responses 
correct.  In  addition,  the  percent  correct  on  the  left  side  (left  button, 
indicating  second  stimulus  different)  and  the  percent  correct  on  the  right 
side  (right  button,  indicating  third  stimulus  different)  were  shown,  in  order 
to  detect  response  bias  for  either  stimulus.  At  the  end  of  the  test  session, 
stimulus  and  response  values  were  stored,  and  the  thresholds  computed.  A 
not  discriminated  stimulus  resulted  in  a  correct  response  percentage  of 
50%.  Discrimination  thresholds  were  considered  as  75%  correct,  and  were 
calculated  by  a  linear  regression  formula  applied  to  llie  psychometric 
function.  The  threshold  was  thus  the  minimum  phase  shift,  in  degrees, 
detectable  at  75%  correct.  Ripple  densities  from  0.5  ripples/octave  to  8 
ripples/octave  were  tested.  Each  ripple  density  was  te.sted  at  least  twice, 
totalling  294  presentations.  A  threshold  was  obtained  for  each  subject  at 
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each  ripple  density  value.  Further  conditions  were  presented  as  described 
below. 

a.  Ramped  vs.  Unramped  Envelope  Condition 

In  order  to  evaluate  whether  the  subjects  used  spectral  envelope 
information  across  the  entire  width  of  the  spectrum  or  were  biased  by 
changes  at  the  edges  of  the  spectrum,  a  ripple  stimulus  was  created  in 
which  the  spectral  envelope  was  ramped  at  the  low-  and  high-frequency 
end  of  the  signal.  In  the  ramped  portions,  the  depth  of  modulation 
changed  gr'^dually  over  0.5  octaves  from  a  modulation  depth  of  0  dB  at  the 
high  and  low-frequency  ends  of  the  spectrum  to  the  full  value  of  20  dB. 
Since  the  total  signal  width  was  3  octaves,  the  fully  modulated  envelope  was 
realized  over  2  octaves. 

Two  subjects  of  the  originally  described  six  subjects  were  tested  on 
the  ramped  ripple  stimulus  at  7  different  ripple  densities,  ranging  from  0.5 
ripples/octave  to  6  ripples/octave. 

b.  Depth  of  Modulation 

The  ripple  stimulus  had  been  created  with  the  possibility  of  varying 
several  of  its  parameters.  One  such  parameter  which  was  varied  for  one 
set  of  tests  was  depth  of  modulation  of  the  envelope  of  the  ripple  stimulus. 
It  was  varied  across  the  values  2.5,  5,  7,  10,  20,  and  30  dB,  for  each  of  the 
ripple  densities  of  0.5,  1,  2,  and  4  ripples/octave.  Two  subjects  were 
tested. 

c.  Intensity 

The  intensity  of  the  stimulus  had  been  60  dB  SPL  throughout  all 
previous  testing.  For  this  set  of  tests,  the  overall  intensity  levels  were 
varied  across  30,  40,  50,  60,  and  70  dB  SPL.  Thresholds  were  obtained  at 
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3  different  ripple  densities  (  one,  three  and  five  ripples/octave)  for  three 
subjects. 

RESULTS 


Thresholds  for  the  just  detectable  shift  in  envelope  phase  were 
obtained  for  each  subject  at  various  ripple  densities.  The  threshold 
represents  the  smallest  envelope  phase  shift  in  degrees  that  the  subject  can 
detect  witli  75%  or  higher  accuracy.  Figure  4a  shows  a  typical 
psychometric  function  for  a  representative  subject.  Performance  (percent 
correct)  is  plotted  as  a  function  of  degrees  shifted  negative  and  positive,  in 
this  case  for  a  ripple  density  of  1  ripple/octave.  Figure  4b  shows  the 
performance  of  the  same  subject  for  a  ripple  density  of  4  ripples/octave. 

In  each  test  session,  each  value  was  presented  7  times.  Each  ripple  density 
session  was  presented  at  least  twice.  Values  shown  represent  the  average 
performance  over  14  stimulus  presentations.  The  thresholds  for  a  ripple 
density  of  1  ripple/octave  were  +2.3  for  positive  phase  shifts  and  -4.7 
degrees  for  negative  phase  shifts.  The  thresholds  for  a  ripple  density  of  4 
ripples/octave  were  +13.5  and  -22.5  degrees,  respectively. 

Figures  5  and  6  illustrate  the  performances  for  each  of  the  six 
subjects,  presenting  thresholds  (minimum  phase  shift  detectable  at  75% 
correct)  in  firstly  positive,  then  negative  directions,  as  a  function  of  ripple 
density.  Note  that  the  overall  dependence  of  the  phase  threshold  is 


nonmonotonic,  and  that  for  positive  phase  shifts,  the  threshold  generally 


increased  with  an  increase  in  ripple  density  to  a  maximum  at  about  4  to  5 
ripples/octave,  then  generally  decreased.  For  negative  phase  shifts, 
similarly  nonmonotonic  behavior  was  seen  but  there  was  greater  variability 
in  the  shapes  of  the  functions,  and  thresholds  were  highest  at  anywhere 
from  3  to  8  ripples/octave. 
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Figure  7  presents  thresholds  across  all  tested  ripple  densities  pooled 
for  all  subjects:  a)  for  positive  phase  shifts  and  b)  for  negative  phase  shifts. 
For  positive  phase  shifts,  thresholds  were  highest  at  4.5  and  5 
ripples/octave.  The  lowest  threshold  was  at  1.5  ripples/octave,  where 
variability  among  subjects  was  lowest.  For  the  negative  phase  shifts,  the 
highest  thresh^.,  i  was  at  a  ripple  density  of  4  ripples/octave.  The  lowest 
threshold  was  again  at  1 .5  ripples/octave,  where  variability  among  subjects 
was  also  lowest.  Tables  I  and  II  present  the  means  and  standard  deviations 
for  each  of  the  averaged  values  across  all  ripple  densities,  for  envelopes 
shifted  in  both  positive  and  negative  directions.  A  repeated-measures 
ANOVA  was  carried  out  for  botn  positive  threshold  and  negative  threshold 
data.  For  the  positive  data,  the  overall  F  for  threshold  differences  between 
different  ripple  densities  was  significant  at  p=0.03.  Comparison  among 
individual  means  using  the  Scheffe  test  for  the  various  ripple  densities 
showed  differences  significant  at  p<0.05  for  the  following  ripple  density 
pairs:  .5  vs.  4,  4.5,  and  5;  1  vs.  4,  4.5,  and  5;  1.5  vs.  4,  4.5,  and  5;  2  vs. 
4.5  and  5;  2.5  vs.  4,  4.5,  and  5;  3.5  vs.  4.5,  and  5;  4.5  vs.  8;  and  5  vs.  8. 
For  the  negativ^' ’  Id  data,  the  overall  F  was  not  significant  (p=:0.07), 
although  a  si  .  ,al  trend  was  observed.  However,  in  comparing 

the  peaks  f ,  '  .su)  s  in  the  positive  vs.  the  negative  phase  shift 
condition  (two  group  paired  t-test),  there  was  a  statistically  significant 
(p=:0,01)  difference  between  the  location  of  the  peaks  in  the  two  conditions. 

Thus,  in  general,  what  was  observed  for  both  positive  and  negative 
envelope  phase  shifts  was  an  increase  in  threshold  as  ripple  density 
increased  up  to  approximately  4  to  5  ripples/octave,  then  a  decrease,  at 
least  to  tlie  extent  of  the  ripple  densities  tested  here,  i.e.  8  ripples/octave, 
a.  Ramped  vs.  Unramped  Condition 
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In  order  to  evaluate  whether  the  subjects  used  spectral  envelope 
information  across  the  entire  width  of  the  spectrum  or  were  biased  by 
changes  at  tlie  edges  of  the  spectmm,  a  ramped  ripple  stimulus  was  created. 
Tire  results  of  the  tests  on  the  ramped  ripple  stimulus  as  compared  to  the 
original  un-ramped  stimulus  are  shown  in  Figure  8.  The  positive  phase 
shift  tliresholds  are  plotted  as  a  function  of  different  ripple  densities  for  the 
ramped  and  un-ramped  conditions.  For  subject  (dk),  the  tliresholds  were 
somewhat  higher  in  tlie  ramped  condition  for  all  ripple  densities;  however, 
the  shapes  of  the  functions  were  very  similar,  with  tlie  highest  thresholds  at 
a  density  of  5  ripples/octave.  Comparison  of  the  two  conditions  (ramped 
vs.  unramped)  using  a  paired  t-test  revealed  tliat  they  were  different  for 
this  subject  (p=0.01).  For  subject  (be),  tlie  tliresholds  were  very  similar  in 
the  ramped  and  un-ramped  conditions.  Thresholds  weie  higher  in  the 
ramped  condition  for  5  of  8  ripple  densities.  Again,  tlie  shapes  of  the 
functions  were  very  similar,  tliis  time  witli  the  highest  thresholds  at  4 
ripples/octave.  For  this  subject,  when  the  two  conditions  were  compared 
using  a  paired  t-test,  they  did  not  significantly  differ  (p=.89).  Thus,  for 
each  subject  the  global  pattern  of  threshold  distribution  was  the  same. 
However,  since  there  were  differences  between  some  of  the  means  for  one 
of  the  subjects,  there  is  an  indication  that  different  subjects  may  use 
different  spectral  regions  in  their  discrimination  processing  of  envelope 
shifts. 

b.  Depth  of  Modulation 

'Fhe  depth  of  modulation  was  varied,  to  shed  light  on  tlie  contrast 
required  to  discriminate  spectral  energy  peaks.  The  results  from  these  test 
sessions  are  shown  in  Figure  9.  The  positive  phase  shift  thresholds  are 
plotted  as  a  function  of  the  depth  of  modulation,  for  four  different  ripple 
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densities.  For  both  subjects,  similar  trends  were  observed,  namely  a)  a 
rapid  change  of  threshold  with  modulation  depths  between  2.5  and  7.5  dB, 
and  b)  a  relatively  shallow  threshold  curve  for  modulation  depths  of  10,  20 
and  30  dB.  In  both  subjects,  thresholds  were  highest  at  a  depth  of 
modulation  value  of  2.5  dB  for  all  four  ripple  densities  tested,  lliresholds 
then  rapidly  decreased  at  depth  of  modulation  values  of  5,  and  7.5  dB.  For 
the  first  subject  (dk),  there  was  a  slight  increase  at  a  deptli-of-modulation 
value  of  10  dB  across  all  4  ripple  densities,  and  then  generally,  a  decline  in 
threshold.  ITie  absolute  lowest  threshold  for  this  subject  was  at  a  depth  of 
modulation  of  20  dB  witli  ripple  densities  of  1  and  0.5  ripples/octave.  For 
subject  be,  the  threshold  decreased  from  2.5  to  10  dB  deptli  of  modulation 
Ibr  all  ripple  densities  except  one  (ripple  density  =  4  ripples/octavc),  where 
it  was  increased,  llie  absolute  lowest  threshold  for  this  subject  was  at  a 
depth  of  modulation  of  30  dB  with  a  ripple  density  of  0.5  ripplcs/octave. 
For  both  subjects,  lowest  thresholds  were  at  either  20  or  30  dB  depth  of 
modulation  for  all  ripple  densities. 

It  appeared  from  the  results  of  Figure  9  tliat  the  tlTiresholds  at  the 
lowest  tliree  depths  of  modulation:  (2.5,  5,  and  7.5  dB)  grouped  separately 
from  tlie  thresholds  derived  for  the  second  tlirce  depths  of  modulation:  (10, 
20,  and  30  dB).  'ITierefore,  threshold  values  were  collapsed  across  ripple 
densities  and  divided  into  these  two  groups,  and  compared.  A  linear 
regression  analysis  was  perfonned.  ITie  values  for  the  two  subjects  were 
combined,  i.e.  at  each  point,  they  were  aaded  and  averaged.  The 
regression  for  the  first  group  of  modulation  depths  was  compared  to  the 
regression  for  the  second  group;  tlie  regression  coefficients  (slopes)  were 
significantly  different  at  p<0.01.  Figure  10  illustrates  the  comparison  of 
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the  regression  lines  for  the  two  groups  of  depth  of  modulation  for  the  two 
subjects'  combined  values. 

Thus,  it  appears  that  below  a  depth  of  modulation  value  of 
approximately  7.5  dB,  the  threshold  for  detecting  ripple  stimulus  envelope 
phase  shifts  increases  dramatically.  Above  the  depth  of  modulation  value 
of  7.5  to  10  dB,  the  threshold  varies  only  slightly  ,  at  least  for  the  depths  of 
modulation  presented  herein,  namely,  10,  20,  and  30  dB. 
c.  Intensity 

Vowels  are  spoken  and  differentiated  over  a  wide  range  of  levels. 
ITicrefore,  it  is  important  to  know  whether  the  ripple  phase  tliresholds 
measured  are  intensity  independent,  or  whether  they  vary  according  to  the 
intensity  at  which  tliese  stimuli  are  presented.  Results  are  shown  in  Figure 
1 1 .  The  positive  phase  shift  thresholds  arc  plotted  as  a  function  of  ripple 
densities  for  five  different  intensity  levels. 

For  subject  kk,  mean  and  standard  deviation  for  tlie  variability  due 
to  intensity  at  each  ripple  density,  are  shown  in  Table  III.  When  an 
ANOVA  was  computed  comparing  the  values  for  tlirce  ripple  densities 
across  five  intensities,  the  overall  F  was  significant  at  p=0.01,  but  there 
were  no  significant  differences  among  the  various  intensities  (Fisher  PLSD, 
Scheffe,  and  Duiuiett). 

For  subject  dk,  similar  results  were  found.  The  mean  and  standard 
deviation  for  tlie  variability  due  to  intensity  at  each  ripple  density  is  shown 
in  Table  IV.  When  an  ANOVA  was  computed  comparing  the  values  for 
three  ripple  densities  across  five  intensities,  the  overall  F  was  significant  at 
p<0.0!  (me  curve  was  not  flat),  but  there  were  no  significant  differences 
among  the  various  intensities  (for  all  post-hoc  tests:  Fisher  PLSD,  Scheffe, 
and  Dunnett). 
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For  subject  be,  results  were  again  similar.  The  mean  and  standard 
deviation  for  the  variability  due  to  intensity  at  each  ripple  density,  is  shown 
in  Table  V.  When  an  ANOVA  was  computed  comparing  the  values  for 
three  ripple  densities  across  five  intensities,  the  overall  F  was  not 
significant  (p-0.10);  there  were  no  significant  differences  among  the 
various  intensities.  Thus  in  general  threshold  does  not  appear  to  be 
strongly  dependent  on  intensity,  at  least  over  the  tested  range. 


Phase  shifts  compared  to  frequency  differences 

Wlien  tlie  spectral  envelope  of  the  ripple  stimulus  is  shifted,  tlie 
frequency  values  at  the  location  of  the  peaks  change.  Actually  the  amplitude 
at  the  various  component  frequencies  changes  so  that  different  peaks  occur 
but  the  iTcquencies  of  the  components  are  always  constant.  Ncveitlielcss,  it 
is  possible  to  calculate  the  frequency  difference  Uiat  would  correspond  to  a 
particular  ripple  stimulus  envelope  phase  shift  by  using  the  Ibnnula:  phase 
shift  =  (log  (b/a))/log  2)*360*  ripple  density,  where  'a'  is  the  center 
frequency  of  the  standard  stimulus,  in  this  case,  1 .38  kHz,  and  'h'  is  the 
center  frequency  of  the  comparison  stimulus.  We  compared  the  frequency 
dilference  equivalent  to  the  mean  threshold  envelope  i)hasc  shift  to 
frequency  difference  limens  previously  reported  for  pure  tones.  The 
frequency  difference  (in  Hz)  at  the  center  frequency  that  con'csponds  to  the 
average  threshold  foi  positive  envelope  phase  shifts  at  each  ripple  density  is 


listed  in  Table  VI.  Similar  frequency  differences  for  negative  envelope 
phase  shifts  are  listed  in  lablc  VII. 

Wier,  Jesteadt,  and  Green  (1977)  reported  frequency  difference 
limens  (DLs)  for  humans,  incoiporating  tJieir  own  and  previous  data  (see 
p.30).  Fay  (1988)  reports  tliat  die  frequency  discrimination  threshold  (or 
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frequency  difference  limen)  at  a  frequency  of  1000  Hz  is  1.9  Hz  and  ai 
2000  Hz  is  3.2  Hz  for  humans  (p.458).  Extrapolating  from  this 
information,  one  may  estimate  the  frequency  dl  at  1.38  kHz  to  be 
approximately  2.5  Hz.  From  Tables  VI  and  VII,  we  obseivc  that  the 
smallest  shift  detected  was  equivalent  to  a  frequency  shift  of  3.89  Hz  (for  a 
positive  shift  at  8  ripplcs/octave).  The  negative  shift  at  8  ripples/octave 
was  quite  similar  (3.96  Hz).  ITius,  the  smallest  discriminable  frequency 
difference  observed  is  larger  tlian  the  frequency  DL  value  reported  by  Fay, 
by  almost  a  factor  of  2.  Although  pure  tone  discrimination  and  ripple 
discrimination  are  clearly  not  tlie  same,  the  comparison  may  elucidate 
psychophysical  processing  mechanisms. 

DISCUSSION 

Humans  appear  to  be  most  adept  at  discriminating  ripple  envelope 
shifts  at  spacings  of  3.5  or  fewer  peaks/octave.  The  spacings  of  English 
vowels  are  mostly  of  3  or  less  pcaks/octavc.  Not  surprisingly,  perhaps, 
humans  are  best  at  discriminating  changes  within  tlie  spacings  of  English 
vowels.  Since  fonnants  are  detennined  by  natural  resonances  o*'  the  human 
vocal  tract,  die  implication  arises  that  die  human  auditory  system  has  either 
evolved  to  be  best  at  discriminating  those  parameters  diat  arc  important  in 
speech,  and/or  through  adaptive  learning,  has  developed  a  fonnal 
representation  of  diis  most  heavily  practiced  peak-spacing  domain.. 

Flanagan  (1955)  first  provided  detailed  measures  of  difference 
limens  (DLs)  for  vowel  fomiant  frequencies.  In  his  studies,  for  each 
syndiesized  vowel  three  out  of  four  of  the  formants  were  held  constant, 
while  either  FI  or  F2  was  varied.  In  the  first  set  of  tests,  F2  was  15(X)  Hz, 
F.3  measured  2500  Hz,  F4  was  3550  Hz,  and  FI  was  set  at  300,  500,  or  700 
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Hz.  Frequency  variations  used  in  determining  the  DLs  for  FI  were  -♦-/-lO, 
+/-20,  +/-30,  +/-40,  +/-50,  +/-60,  and  +/-70  Hz. 

In  a  second  set  of  tests,  F2  was  set  at  1000,  1500,  or  2000  Hz;  FI 
was  held  constant  at  500  Hz;  the  frequency  variations  used  in  determining 
the  DLs  for  F2  were  +/-25,  +/-50,  +A75,  +/-100,  +/-125,  +/-150,  and  +/- 
175  Hz.  Subjects  heard  two  stimuli  and  were  asked  to  report  whether  the 
second  sound  was  the  same  as  or  different  from  the  first.  The  50%  point 
of  stimuli  judged  different  was  taken  as  the  DL.  The  mean  just-discernible 
frequency  shift  was  approximately  3-5%  of  the  formant  frequency. 

Comparing  the  Flanagan  results  to  the  present  ripple  study  results  is 
complicated  by  the  fact  that  the  peak  or  formant  spacings  in  synthesized 
vowels  were  not  identical  among  all  peaks  within  a  vowel,  as  they  were  in 
our  ripple  study.  However,  ignoring  for  the  moment  the  formants  above 
F2,  tlie  various  spacings  between  FI  and  F2  studied  may  be  considered.  In 
the  first  set  of  tests  in  which  FI  was  varied,  the  frequency  values  of  FI  and 
F2  were  i)  300  Hz  and  1500  Hz;  ii)  500  Hz  and  1500  Hz;  and  iii)  700  Hz 
and  1500  Hz,  respectively.  In  the  second  set  of  tests,  in  which  F2  was 
varied,  the  frequency  values  of  FI  and  F2  were  i)  500  Hz  and  1000  Hz;  ii) 
500  Hz  and  1500  Hz;  and  iii)  500  Hz  and  2000  Hz.  The  equivalent  ripple 
densities  are  0.4,  0.6,  0.9,  1,  0.6,  and  0.5  ripples  or  peaks/octave. 

Flanagan  reported  that  human  subjects  could  detect  a  3  to  5%  shift  in 
spectral  peak  position.  That  would  be  equivalent  to  a  55  Hz  shift  at  the 
center  frequency  of  1.38  kHz  in  the  present  human  ripple  envelope  phase 
shift  study.  The  equivalent  envelope  phase  shift  for  a  ripple  density  of  1 
ripple/octave,  is  20.3  degrees.  For  a  ripple  density  of  0.5,  the  phase  shift 
value  equivalent  to  55  Hz  is  10.1  degrees.  As  will  be  recalled  from  Tables 
VI  &  VII,  human  phase  shift  thresholds,  at  a  ripple  density  of  1 
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ripple/octave,  were  7  degrees  for  positive  envelope  phase  shifts  and  8.6 
degrees  for  negative  envelope  phase  shifts,  averaging  7.8  degrees.  The 
human  phase  shift  thresholds  at  a  ripple  density  of  0.5  were  7.3  degrees  for 
positive  envelope  phase  shifts,  and  13.5  degrees  for  negative  envelope 
phase  shifts,  averaging  10.4  degrees.  The  equivalent  frequency  DLs  were 
21  Hz  and  54  Hz  for  ripple  densities  of  1  and  0.5  ripples/octave.  This 
information  is  presented  in  Table  Vni. 

Thus  the  frequency  DL  values  observed  by  Flanagan  of 
approximately  3-5%  of  the  frequency  tested  matched  the  measures  of  the 
detectable  shifts  for  0.5  ripples/octave  stimuli,  but  distributions  at  1 
ripple/octave  were  poorer  for  vowel  peak  shift  than  would  be  expected 
from  the  current  measures  of  ripple  peak  shifts  in  our  human  study. 

Van  Veen  &  Houtgast  (1985)  have  earlier  argued  that  the  depth  of 
modulation  plays  only  a  r:iinor  role  in  vowel  discrimination.  They  also 
observed  that  the  ripple  densities  that  are  most  important  for  the  perception 
of  differences  in  spectral  sharpness  are  around  2  ripples/octave,  hi  the 
current  study,  it  was  observed  that  across  four  ripple  densities  (0.5,  1,  2, 
and  4  ripples/octave),  depth  of  modulation  changes  affected  discrimination 
similarly,  with  depths  below  7.5  dB  producing  a  dramatic  increase  in 
threshold.  It  is  difficult  to  compare  these  results  directly  with  those  of  van 
Veen  &  Houtgast  because  they  u.sed  a  different  filtering  method  to 
spectrally  sharpen  their  stimuli.  However,  in  contradistinction  to  their 
general  conclusions,  the  present  study  reveals  that  for  deptlis  of  modulation 
below  about  7.5  dB,  shifts  in  frequency  peak  positions  rapidly  come  to  be 
far  more  difficult  to  detect. 

Shamma  &  Vranic  (1993)  reported  that  the  detection  of  change  in 
symmetry  in  spectral  shape  was  more  dependent  on  peak  frequency  (pitch 
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effect)  than  was  the  detection  of  a  change  in  bandwidth.  They  also 
reported  that  "the  listeners  required  less  training  and  exhibited  higher 
sensitivity  (lower  thresholds)  for  the  detection  of  symmetry  than  bandwidth 
factor  peak  changes"  (p.47).  Vranic,  Versnel,  &  Shamma  (1993)  also 
presented  both  single  and  double  spectral  peaks,  but  did  not  report  a 
covariance  of  neural  responses  of  AI  neurons  in  ferret  cortex  with  the 
number  of  spectral  peaks.  An  expanded  report  concerning  tlie  effect  of 
single  and  double  peaks  on  psychophysical  discrimination  of  these  complex 
stimuli  would  be  revealing,  and  directly  comparable  to  the  present  study. 
Their  data  imply  that  spectral  shape  is  very  infomiative  to  the  nervous 
system,  and  that  many  parameter  of  complex  stimuli  provide 
discriminatory  cues. 

In  Figure  12,  the  average  minimum  envelope  positive  phase  shifts 
detected  across  the  ripple  densities  tested  -  0.5  to  8  ripples/octavc  -  are 
shown.  Also  shown  are  the  frequency  difference  limen  (dl)  values  for  4 
different  frequency  values:  1  kHz,  3  kHz,  6  kHz,  and  8  kHz,  plotted  against 
ripple  density.  As  for  formant  ratios,  frequency  difference 
limen/frequency  ratios  can  be  converted  to  a  logarithmic  scale,  and  tenned 
"peaks"  or  "events  per  octave",  or  "ripple  density".  'Hius,  as  can  be  seen, 
as  ripple  density  increases,  tlic  number  of  event.s/octave  that  the  frequency 
difference  limen  represents  also  increases.  It  should  be  noted  that  the  slope 
of  the  envelope  phase  shift  threshold  and  the  slopes  of  the  frequency 
difference  limen  functions  arc  generally  similar.  This  suggests  that  the 
discriminative  functions  involved  in  frequency  discrimination  and  in  ripple 
envelope  shift  discrimination  may  be  similar.  However  as  was  earlier 
pointed  out ,  the  frequency  dl  for  a  frequency  of  1.38  kHz  (tlte  center 
frequency  of  the  human  ripple  stimulus)  is  approximately  2.5  Hz,  while  the 
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smallest  discriminable  phase  shift  for  ripple  stimuli  observed  in  this  study 
was  equivalent  to  a  frequency  value  of  approximately  4  Hz.  Thus,  the 
values  do  not  coincide. 


The  critical  band  ratio  converted  to  ripple  density  is  also  shown  in 
Figure  12.  Its  value  is  approximately  4  ripples/octave,  and  corresponds  to 
the  transition  to  increased  thresholds,  suggestive  of  an  interference  of  the 
filter  (critical  band)  spacing  with  the  spacing  of  major  spectral  peaks  in  the 
stimulus.  7  bus  peaks  of  spacings  of  less  than  4  per  octave  would  span  the 
critical  band  border,  and  discriniinability  of  envelope  phase  shifts  within 
tlie  range  from  0..5  to  approximately  4  ripples/octave  would  be  good  and 
fairly  constant,  as  it  appears  to  be.  Peaks  spaced  at  4  per  octave  or  more 
would  fall  witliin  single  critical  bands;  therefore  envelope  peak  shifts 
would  not  be  as  easily  discriminable,  and  the  threshold  should  increase. 

It  can  be  observed  tliat  tlie  minimum  envelope  phase  shift  function 
decreases  after  a  peak  at  about  5  ripples/octave.  There  is  an  increase  again 
at  7  ripples/octave  and  tlien  anotlier  decrease  at  8  ripples/octave.  All 
subjects  showed  a  decrease  in  threshold  after  a  peak  around  4  or  5 
ripples/octave.  Some  did  not  show  a  second  later  increase  in  threshold. 
Why  should  tlie  tlircshold  decrease  after  approximately  5  ripples/octave? 
As  mentioned,  some  subjects  showed  a  second  peak  in  thre.shold,  while 
others  did  not.  It  may  be  that  there  is  a  second  interference  witli  peak 


spacing  discrimination  related  to  a  function  of  the  critical  band. 


Studies  by  Schreiner,  Calhoun,  &  Keeling  (1993)  have  shown  that 


M 


untrained"  cats  have  tlie  best  neural  resi.-onse  to  ripple  densities  between  1 


and  2  ripples/octave.  7’his  implies  that  tlicre  may  be  something  inliercntly 


biologically  appealing  about  tlie  octave  scale,  in  tliat  one  event  per  octave, 
or  one  event  for  every  doubling  of  frequency,  facilitates  discrimination. 
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With  the  ci'tical  band  being  approximately  one-quarter  to  one-third  octave 
wide,  one  might  expect  that  spectral  peak  spacings  of  less  than  3  or  4  would 
be  fairly  easily  discriminated,  since  the  peaks  fell  into  different  critical 
bands.  It  is  also  the  case  that  most  spacings  of  formants  in  English  vowels 
are  less  than  4  peaks  per  octave.  Thus  animal  electrophysiological  data  and 
the  human  psychophysical  data  indicate  that  formants  arc  located  within  a 
range  of  spacing  values  that  coincide  with  maximally  discriminable 
perceptual  information  elements. 

Infant  studies  have  shown  that  the  young  infant  has  a  very 
sophisticated  ability  to  perceive  the  basic  sound.s  of  language  (Eimas  & 
Tartter,  1979).  Infants  even  below  the  age  of  4  months  are  able  to  make 


discriminations  among  steady-state  vowels.  Trehub  (1973)  showed  that  [a] 
could  be  distinguished  from  [i]  whether  in  isolation  or  in  the  context  of  a 
preceding  stop  consonant.  Infants  could  also  distinguish  [i]  from  [u], 
Swoboda,  Morse,  &  Leavitt  (1976)  showed  that  the  vowel  contrast  [i]  vs  [I], 
as  in  "beet  "vs  "bit"  was  discriminable  by  infants.  Trehub,  Endman,  & 
Thorpe  (1990)  presented  complex  stimuli  tliat  differed  in  spectral  structure 
to  7  to  8.5  month-old  infants.  Infants  successfully  differentiated  two 
spectral  structures,  even  when  they  varied  in  fundamental  frequency, 
intensity,  or  duration.  It  was  concluded  that  infants  classify  tonal  stimuli 
on  tlie  basis  of  timbre.  Similarly,  Clarkson,  Clifton,  &  Perris  (1988) 


presented  sounds  that  differed  in  their  spectral  cnvelope.s  to  7=month-nld 


infants,  who  successfully  discriminated  the  sounds.  They  point  out  that 


vowels  have  multidimensional  qualities  which  distinguish  tliem 


individually;  tliey  argue  that  timbre  is  a  multidimensional  attribute,  and 


point  to  evidence  that  shows  that  "adults'  perception  of  tonal  complexes 
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reveals  that  the  spectral  envelope  is  the  most  important  determinant  of 
timbre  (Plomp,  1970)  '  (p.20). 

Thus,  the  ability  to  process  vowel  differences  based  upon  changes  in 
spectral  envelope  seems  to  be  an  innate  ability  in  humans.  From  the  animal 
behavioml  data  (Schreiner  et  al.  1993),  discriminability  of  changes  in 
spectral  envelopes  is  also  present  in  cats.  It  may  be  a  basic  manrunalian 
auditor^'  processing  capacity. 
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Figure  1.  Ripple  Density  for  Fi/F2  and  for  F2/F3.  Values 
range  from  0.3  to  4.5  ripples  per  octave,  with  the  majority 
below  3  ripples/octave. 
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Ripple  Stimulus 

(centered  at  1.38  kHz,  3  octaves  wide, 
ripple  density  =  1  ripple/octave) 
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shifted  45  degrees.  As  can  be  seen,  the  frequency  values  of  the 
peaks  are  shifted. 
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Figure  4.  Performance  of  subject  dk  on  a  ripple  stimulus  of 
ripple  density  a)  1  ripple/octave  and  b)  4  ripples/octave.  The 
thresholds  for  a  ripple  density  of  1  ripple/octave  were  +2.3  and 
>4.7  degrees.  The  thresholds  for  a  ripple  density  of  4 
ripples/octave  were  +13.5  and  -22.5  degrees. 
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Figure  5.  Performance  of  subjects  across  ripple  densities,  for 
positive  envelope  phase  shifts. 
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Figure  6.  Performance  of  subjects  across  ripple  densities,  for 
negative  envelope  phase  shifts. 
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Figure  7.  Thresholds  pooled  for  all  subjects  a)  for  positive 
phase  shifts  and  b)  for  negative  phase  shifts,  showing  means  and 
standard  deviations  at  each  ripple  density. 
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Figure  8.  Thresholds  for  ramped  and  unramped  stimulus 
conditions  for  two  subjects.  Some  subjects  may  use  the  edge  of 
the  spectrum  in  their  discrimination  processing. 
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Figure  9.  Threshold  as  a  function  of  depth  of  modulation  (two 
subjects).  Two  subjects  were  tested  across  four  ripple  densities 
at  six  different  depths  of  modulation. 
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Figure  10.  Comparison  of  the  regression  lines  for  the  two 
groups  of  depth  of  modulation  for  the  two  subjects'  combined 
values.  Below  a  depth  of  modulation  value  of  7.5  dB,  the 
threshold  for  detecting  ripple  stimulus  envelope  phase  shifts 
increases  dramatically. 
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Figure  11.  Threshold  at  various  intensities  (three  subjects). 
Threshold  does  not  appear  to  be  strongly  dependent  on  intensity 
over  the  tested  range. 
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Figure  12.  Frequency  Difference  Limen  re  Envelope  Phase 
Shift.  Human  thresholds  plotted  against  rippie  density, 
frequency  difference  limens,  and  critical  band  density. 
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t 


ripple  density 

mean  threshold 

shift  standard  deviation 

0,5 

7.3 

4.4 

1 

7 

3.6 

1.5 

5.9 

2.5 

2 

9 

4.2 

2.5 

8.7 

5 

3 

14.2 

5.9 

3.5 

11 

3.9 

4 

20.5 

6.9 

4.5 

25.5 

8.4 

5 

25.5 

10.6 

6 

13.9 

7.1 

n 

t 

21J 

5.3 

8 

11.7 

4,7 

Table  1.  Mean  threshold  shift  and  standard  deviation  at  each 
rip;>le  de.isity  for  positive  phase  shifts  (N=6) 
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ripple  density 

mean 

threshold  shift  standard  deviation 

0.5 

13.5 

3.4 

1 

8.6 

4 

1.5 

7.8 

1.8 

2 

12.4 

5.3 

2.5 

14.4 

5.4 

3 

18.3 

10 

2.5 

15.1 

6.9 

4 

24.7 

5.6 

4.5 

15.6 

4.5 

5 

14.8 

4.8 

6 

16.6 

5 

7 

15.3 

4 

8 

11.9 

7.1 

Table  II.  Mean  threshold  shift  and  standard  deviation  at  each 
ripple  density  for  negative  phase  shifts  (N=6) 
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mean  threshold 

standard  deviation 

rd=l 

7.5 

2.4 

rd=3 

10.7 

2.4 

rd=5 

14.8 

3.8 

Table  III.  Means  and  standard  deviations  across  intensities  for 
subject  kk. 


mean  threshold 

standard  deviation 

rd~l 

3.3 

1.5 

rd=3 

9.5 

2.0 

rd=5 

20.9 

4.8 

Table  IV. 
subject  dk. 

Means  and  standard  deviations 

across  intensities 

mean  threshold 

standard  deviation 

rd=l 

4.5 

1.9 

II 

5.9 

3.0 

rd=5 

9.4 

4.6 

Table  V.  Means  and  standard  deviations  across  intensities  for 
subject  be. 
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ripple  density 

mean  threshold  (in 
degrees) 

difference  in  frequency 
(in  Hz) 

0.5 

7.3 

-38.25 

1 

7 

-18.47 

1.5 

5.9 

-10.41 

2 

9 

-11.91 

2.5 

8.7 

-9.22 

3 

14.2 

-12.52 

3.5 

11 

-8.33 

4 

20.5 

-13.55 

4.5 

25.5 

-14.97 

5 

25.5 

-13.48 

6 

13.9 

-6.14 

7 

21.1 

-7.99 

8 

11.73 

-3.89 

Table  VI.  Frequency  differences  corresponding  to  threshold 
positive  phase  shifts. 
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npple  density 

mean  threshold  (in 
degrees) 

difference  in  frequency 
(in  Hz) 

0.5 

13.5 

73.64 

1 

8.6 

23.04 

1.5 

7.8 

13.89 

2 

12.4 

16.57 

2.5 

14.4 

15.39 

3 

18.3 

16.3 

3.5 

15.1 

11.51 

4 

24.7 

16.51 

4.5 

15.6 

9.24 

5 

14.8 

7.89 

6 

16.6 

7.37 

7 

15.3 

5.82 

8 

11.9 

3.96 

Table  VII.  Frequency  differences  corresponding  to  threshold 
negative  phase  shifts. 
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Flanagan  study 


Ripple  study 


Ripple  Density 
(ripples/octave) 

Average 
Frequency  DL 

Threshold 
Phase  shift 

0.5 

55  Hz 

10.1  degrees 

1.0 

55  Hz 

20.3  degrees 

0.5 

54  Hz 

10.4  degrees 

1.0 

21  Hz 

7.8  degrees 

Table  VIII.  Comparison  of  Flanagan  DL  values  and  ripple 
study  threshold  phase  shift  values. 


